Non-linear corrections to Lagrangians predicted by 
causal set theory: Flat space bosonic toy model 



Roman Sverdlov Institute of Mathematical Sciences , 
IV Cross Road, CIT Campus, Taramani, Chennai, 600 113, Tamil Nadu, India 

January 27, 2012 



Abstract 

&i A while ago a proposal have been made regarding Klein Gordon and Maxwell La- 

grangians for causal set theory. These Lagrangian densities are based on the statistical 
analysis of the behavior of field on a sample of points taken throughout some "small" 
region of spacetime. However, in order for that sample to be statistically reliable, 
a lower bound on the size of that region needs to be imposed. This results in "un- 
wanted contributions" from higher order derivatives to the Lagrangian density, as well 
as non-trivial curvature effects on the latter. It turns out that both gravitational and 
non-gravitational effects end up being highly non-linear. In the previous papers we 
were focused on leading order terms, which allowed us to neglect these nonlinearities. 
We would now like to go to the next order and investigate them. In the current paper 
we will exclusively focus on the effects of higher order derivatives in the flat-space toy 
model. The gravitational effects will be studied in another paper which is currently in 
preparation. Both papers are restricted to bosonic fields, although the issue probably 
generalizes to fermions once Grassmann numbers are dealt with in appropriate manner. 



1. Introduction 

A causal set, originally proposed by Rafael Sorkin, is a model of spacetime that replaces 
coordinate system with lightcone causal relations. After all, any geometrical information 
that we "know" about the spacetime we have actually "learned" by analyzing the signals 
that our eyes receive. These signals, in term, are constrained to propagate only between 
" causally related" pairs of points. In light of lack of circular causality, it is clear that causal 
relations form a partial ordering -<, and a signal can propagate from point a to point b if 
and only if a -< b. Thus, the information that we learn first hand is precisely the specific 
structure of that partial ordering. Later on, we "analyze" this structure and "infer" that 
there is some coordinate system that "created" it. The contention of causal set theory is 
to "unlearn" any information we might have inferred (including the coordinate system) and 
view the partial ordering as the one and only geometry available. Accordingly, some models 
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have been proposed that attempt to describe Lagrangians ([3] and [I]) and propagators ([I] 
and [2]) in a coordinate-independent fashion. 

In principle, there are discrete structures for which such procedure is straightforward. 
For example, if we assume cubic lattice, the "nearest causal neighbor" will identify a diagonal 
of any given " cube" . It is relatively easy to count such " diagonals" in order to reconstruct 
the information we had from coordinate system. At the same time, however, cubic structure 
would lead to "preferred directions" (such as edges of the cubes). In order to make the 
structure more covariant-looking, we need to replace cubic structure with Poisson distribu- 
tion of point. However, in case of Poisson distribution, we expect a lot of random variation 
of distances between neighboring points which makes them no longer reliable. In order to 
"recover" continuum quantities we are forced to look at "statistically large" sample of points, 
so that by the law of large numbers the random fluctuations cancel out. That sample of 
points, of course, is assumed to lie within a "very small" region of spacetime, so that the 
linear approximation still holds to high accuracy, thus allowing us to compute Lagrangian 
densities. At the same time, the size of this region is assumed to be several magnitudes 
larger than the discretization scale. 

The fact that the above- described region is finite leads to some unwanted effects. In 
particular, in order to compute Lagrangian density, which is defined in terms of derivatives, 
we need some kind of statistical analysis of the behavior of relevant fields over the "sample of 
points" . The statistical analysis has to be designed in such a way that the main contribution 
comes from the first derivatives of the fields involved. However, the input of the analysis 
consists of the values of the fields at various sample points. Due to the "finite" size of the 
region, the latter includes higher derivatives. If we were to take direct neighbors (as we would 
have done in the case of cubic lattice) we would be able to dismiss higher order derivatives by 
simply saying that they are not well defined since there is nothing "between" the neighbors. 
In causal set context, however, we are taking a "large sample of points" which means that 
we are no longer able to dismiss higher order derivatives in the above way. 

In case of curved spacetime, we will also have curvature effects in addition to the above. 
Again, if we use some regular lattice to model the curved spacetime, we can argue that cur- 
vature is not defined "between" the neighboring points and, therefore, is of no consequence. 
In case of causal set, however, the lack of regular structure forces us to take statistically large 
sample of points. This ultimately implies that the curvature will in fact have some finite 
effect within that sample. At the same time, we are using that finite sample as a discretized 
"infinitesimal" region where we are supposed to "take derivatives". Thus, curvature is "not 
supposed" to have any effect on that region; yet it will. Intuitively, this means that the very 
"tangent plane" is now "curved" . This, of course, is logically separate from the effects of cur- 
vature we would expect in "usual" cases. Therefore, the continuum limit of resulting effects 
might be very different as well, and one needs to perform explicit causal set calculations to 
find out what these effects might be. 

In the previous papers, we were focused on leading order terms which allowed us to ne- 
glect both the contributions of higher order derivatives as well as contributions of curvature. 
We would now like to consider higher order terms and analyze these contributions. This 
attempt is partly motivated by the results of recent OPERA experiment regarding neutrino 
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speed. It is conceivable that the non-linear interactions of a "very weak" field (neutrino) with 
"much larger" background field (earth's gravity as well as its magnetic field, the atmosphere, 
and so forth) might result in the latter setting a "preferred frame" for the propagation of the 
former. In that "preferred frame" the "weaker fields" can, conceivably, be either subluminal 
or superluminal. 

However, no attempt to compute the deviation of speed of propagation was made in 
this paper. Furthermore, the very existence of such "speed" is highly questionable. In fact, 
non-linearities imply that we can no longer appeal to the concept of eigenf unction. If we 
"insist" in doing Fourier decomposition of our solutions, we will likely conclude that any given 
Fourier component "gives birth" to a number of other components moving with different, 
and seemingly unrelated, velocities. One can attempt to argue that the rate of this process 
is very small and, therefore, negligible; at the same time, one can also credit the "very large" 
fields of the earth as an exclusive reason that the non-linear interaction with the latter can 
not be similarly neglected. Again, however, in order to make such claims one needs to do a 
large amount of research that has not been done in this paper. 

The exclusive goal of this paper is simply to explore the non-linear effects that the 
causal set theory will predict, independently of OPERA experiment or any other "outside" 
motivations we might have. One reason we are independently interested in non-linearities 
is that it is perhaps the only tangible way of using causal set theory for the purposes of 
making new predictions (whatever these "new predictions" might "happen" to be). As we 
have stated earlier, causal set theory does not assume any regular structure. This makes it 
very difficult to make analytic calculations without going back to the continuum limit. Thus, 
in order for causal set theory to make new predictions, one has to think of some deviations 
from "traditional" theories that persist in the continuum as well. Non-linear effects we 
just described meet this criteria. On a down side, however, such effects might imply that 
the resulting theories are no longer renormalizable. In principle, we can still attempt to 
compute non-renormalizable theories since the discreteness scale sets up a "preferred" value 
of ultraviolet cutoff. Again, however, the fruitfulness of such calculations is questionable. 

In this paper we will restrict ourselves to finding the non-linear effects on Lagrangian 
density, without actually "using" that Lagrangian density for any calculations. We do, 
however, plan to return to the above issues in future work. Apart from this, the paper at 
hand is restricted to the flat space toy model. In other words, we are exclusively focused 
on the effects of higher order derivatives of Klein Gordon and Maxwell fields in flat space 
context. The effects of curvature will be studied in [TJ, and the generalization to fermions will 
be studied in j8] , which is likewise in preparation. Again, however, both of these papers are, 
likewise, restricted exclusively to Lagrangian densities and they make no attempt to compute 
any Feynmann diagrams or tackle any specific problem (such as OPERA experiment). But 
hopefully these papers might give some direction to either the author, or other readers of 
this paper, to work in future. 
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2. Geometry and fields on a causal set: a brief review 

We consider a partially ordered set (S, -<s), where S is a discrete set. The main reason we 
write -<s rather than simply -< is to distinguish it from -<m which is to be defined on a 
smooth manifold A^.The relation -< s satisfies the axioms of partial ordering: if a -<s b and 
b -<s c then a -<g c, and there is no point a satisfying a -< a. The relation -< is called causal 
relation. Physically, a -< b holds if and only if we can travel from point a to point b without 
going faster than the speed of light. The transitivity of causal relation follows from the fact 
that we can travel from a to c by "first" traveling from a to b and "then" traveling from b 
to c. The fact that none of the points satisfy a -< a is equivalent to the statement that there 
are no closed causal loops. If, for example, we could travel from a to b and then come back 
to a, this would mean that a -< b and b -< a both hold; by transitivity, this would imply 
a -< a, which is "forbidden". 

Two points a and b are said to be direct neighbors if they are causally related and there 
is no other point that is causally "between them". If a b holds, then a -<* s b holds if and 
only if a and b are direct neighbors: 

d4l)^((fl^6)A ^c(ax s c^5)) (1) 

A fundamental structure of our theory is Alexandrov set, a(p, q), which consists of all points 
r satisfying p ^g r <!'■ 

<*s(p, q) = i r \p r q} (2) 

where is defined by 

a^sb^ [(a ^ s b) V (a = b)} (3) 

Geometrically, this looks like a compact region of spacetime bounded by two light cones: 
"future" light cone of p and "past" light cone of q. This set defines a "preferred frame"; 
namely, the "local" t-axis coincides with the geodesic passing from p to q. It is obvious that 
p and q are "direct neighbors" if and only if they happen to be the only elements of the 
Alexandrov set that they form: 

i( a b) A (a s (a, b) = {a, b})] (4) 

In general, however, we would be interested in Alexandrov sets that are non-empty and, in 
fact, contain large enough number of points to be statistically relevant. 

We observe that in case of Minkowski space the distance between two timelike separated 
points is equal to the length of the "longest path" connecting them (which, in flat case, 
happens to be straight line). The fact that that path is the "longest" rather than the 
"shortest" is due to the minus signs in Minkowskian metric. Therefore, we will define a 
discretized distance on a causal set to be the length of the "longest" possible chain of points 
p -< r\ < • • • < r n -i -< q, where the "length" of the above chain is identified with n: 

Ts(p,q) = f max{n|3ri, • • • ,r n _i G S : p -<<? n -< < s r n _ x -< q}, (5) 

It is commonly assumed that £ coincides with Plank scale; but in the opinion of the author 
such doesn't have to be the case. It should be noticed that the two points are "direct 
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neighbors" if and only if the distance between them is £: 

a <* s b ^ [(a ^ s b) A (r s (a, b) = £)] (6) 

A sequence of points is said to be a geodesic if the " longest" path connecting any of its points 
happen to be the segment of that sequence itself: 

{• • • , a_„_i, a_ n , • • • ,a m , a m+1 , • • • } is geodesic 

VA; < I flb u ■ ■ • , &z_ fc (a fc -< s 61 -< s < 5 b t _ k -< a,) (7) 

It should be noticed that the path a k <s &i -<s ' ' ' ~<s fy-fc-i ~<s °z is "allowed" and, at 
the same time, it has "the same" length as the "segment" in question. The only thing we 
claim is that there is no path "longer" than the latter. This choice is crucial since we don't 
want the presence of two same- length paths to prevent us from drawing a "geodesic" . At the 
same time, in order to preserve existence, we sacrifice uniqueness (although we could restore 
uniqueness by "taking a union" of all possible geodesies). 

Since we are planning to routinely compare the causal set S to a manifold A4, it is 
important to formally define the corresponding notions in M.. The timelike curve on M. is 
defined in a coordinate based way: 

7: R ->■ M is timelike curve Vtf gauhit)) ^ ^? > o) (8) 

V at at / 

The causal relation on a manifold M. is -<m (while causal relation on a causal set S is -<s)- 
The relation x^ -< y^ holds if and only if x^ and are connected by at least one timelike 
curve: 

x< M y 3 7 : [0,l]^M(f(0)=x*, 7 M (l) = y M ; ^(7(r))^^ = l,) (9) 

If 7 : R — > M. is timelike, then its length is given by 



r M (i;n,r 2 ) = j ^ g^(r) ^ ^ dr (10) 

The distance between i^gM and ^ G is the length of the longest possible timelike, 
future-directed, curve that connects the two points: 

r(s",y") = sup{r(7;r 1 ,r 2 )| 7 (r 1 ) = z"; 7 fa) = V"; ^ > ^9^)^^- > o} (11) 
Finally, we will define an Alexandrov set on M. to be 

a M (x»,yn = {z» e M\x» < M z» < M y**} (12) 

If M. is a Lorentzian manifold, then the function / : S — > M. is said to be an embeding 
of S into M if it respects causal structure. In other words, 

P^sq^ f(p) <M f(q) (13) 
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The set S is said to approximate Ai (which we write as S ~ Ai if f{S) "looks like" a 
Poisson scattering on Ai with density so large that any "small" region of Ai has "very 
large" number of points. Such set can be produced by "starting" from Ai, performing the 
Poisson process on Ai to obtain a set T C Ai, and finally identifying the set T with f(S) 
for some other "abstract" set S. Then the causal relation -<s can be "read off" from -<m 
together with Equation [TjJJ Such sets have been widely studies and it has been confirmed 
that distances, indeed, approximate what we would expect them to be once the number of 
points is statistically large: 



where d is the dimensionality of Ai, and kd is a coefficient appropriate for that specific 
dimensionality. If we stick to one specific dimensionality it is possible to set kd to 1 by 
appropriately scaling £. 

However, if one insists on viewing as fundamental rather than -<Mi then one is "not 
allowed" to "start off" from Ai. Instead, one is hard pressed to formulate S'-based "physics 
laws" that would "force" S to approximate Ai on a sufficiently large scale. This is one of the 
big problems of causal set theory that is still unresolved. In principle, one can try to do a 
"short cut" by saying that the "physics law" is simply a constraint that "forbid" any S that 
would not "approximate" at least one sufficiently smooth manifold. But then the question is 
how can one rigorously formulate such a constraint? Since S is fundamental rather than Ai, 
that would force us to find an intrinsic property of S through which we can judge whether 
or not S is "manifold-like". In principle, there should be such property: after all, we have 
"learned" that our universe is manifoldlike by "analyzing" the set of rays that hit our eyes 
(in other words, causal relations). At the same time, at least so far, the attempts to specify 
such property had not been successful. Furthermore, even if one does it successfully, it would 
be "pointless" if the constructions involved are too complicated. After all, they would be 
even "less" natural than the coordinate system we were trying to get rid of. 

For the purposes of this paper, we will not attempt to address the questions in the above 
paragraph. Instead, we will simply assume that S ~ Ai, without specifying the reason. Our 
only purpose is to work out the Lagrangian densities on already-given causal set, while 
leaving the question of its origin aside. 

Let us now proceed to define sample fields on a causal set as well as their Lagrangians. 
For the purposes of this paper, we will limit ourselves to Klein Gordon and electromagnetic 
fields, which we will define as S — > C and ag: S x S — > R, respectively. It is further 
assumed that they correspond to well behaved differential functions <Pm : Ai — > C and 
flM:MxM->MonM: 



The "scalar field" on a manifold Ai is identified with <pM itself, while the electromagnetic 
field is identified with A: Ai — > TAi (where TAi is a "tangent bundle" of Ai) and it is 
assumed that 




(14) 



4>M(f(p)) = <f>s(p) ; aM(f(p),f(q)) = as{p,q) 



(15) 




(16) 
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where j(x, y) is a geodesic connecting x and y. The geodesic 7 is defined in terms of ordinary 
manifold-based way, 

drf dx^d^_ 

dr v dr dr 1 ' 

as opposed to Equation [7J After all, Equation [7J is a definition of geodesies on a discrete 
set S, whereas Equation [T71 continues to be the definition in continuum case of M.. As we 
said previously, this paper is focused on flat space, while curvature effects are postponed to 
[7J. The reason we are including Christoffel's symbols is simply because it might benefit the 
reader who will be interested to read other papers, and it doesn't require a lot of extra labor 
on our part. For the purposes of this paper we can assume that rig = 0. 

In the above expression, we have written A 11 instead of A M because A^ 1 does not have S- 
counterpart. On a manifold A4, we define a Lagrangian density Cm : {4>m} x {^} x -M — > R 
in a usual way, 

£(0, A»- x?) = F^F^ + V^V^ (18) 

where 

Our goal is to define a corresponding Lagrangian density £5: {<ps} x { a s} x S — > R such 
that 

£, s {<t>s,a s ;p) * £M{<t>M,A»;f(p)) (20) 

We are approaching that goal by focusing our attention on f(S), but, at the same time, 
attempt to re-express Cm in such a way that it does not appeal to anything containing 
Lorentzian indexes or anything else that lacks S-counterpart. This will allow us to "rewrite" 
our final answer in terms of S rather than f(S). 

Let us now go ahead and define £5, in a way that meets the criteria of Equation [201 We 
will define a small scale r. This scale should be "small enough" in order for the well behaved 
fields to be approximately linear (r 1) but, at the same time, it should be "large enough" 
to contain statistically reliable sample of points (r ^> £); thus, 

£ « r « 1 (21) 

The criteria of "successful" definition of £5 is that it approximates Cm on "our" scale; that 
is, 

£ 5 = £^(l + 0(r)) (22) 

Producing such Cs is the goal of the current chapter. Then, in the next chapter, we will 
"take" £5 produced in this chapter without changing it at all, and evaluate higher order 
terms it would produce. We will find that 

C S = C M + 6C M + 0(r 3 ) (23) 

The "small variation" 5 Cm is the ultimate goal of this paper. As explained in the intro- 
duction, this goal is motivated by a possibility that 5Cm might be responsible (or at least 
contribute to) the deviation of the speed of neutrino from the speed of light (although, as 
far as this paper is concerned, we are not considering the neutrino field). 
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3. First order Lagrangian density for scalar fields 



One proposal of Cs has been made in [3J. However, at the time of writing of that paper, 
we were not interested in finding 5 Cm- As a result, we made some "sloppy" moves that 
would make 5Cm much larger than it should be. The main example of this sort of thing is 
that we assumed that the "neighborhood" of a given point lies "to the future" of that point, 
instead of assuming that the point is at the center of the neighborhood. In the former case, 
we obtain 

C s = C M + 0(r) = C M + 5C M + 0(r 2 ) ; 5C M = 0(r) (24) 
while at the latter case we obtain 

C s = C-m + 0(r 2 ) = C M + 5C M + 0(r 3 ) ; 5C M = 0(r 2 ) (25) 

While the 6 Cm = 0(r) would still be "formally okay" , it is clear that the option that produces 
5 Cm — 0(t 2 ) is by far more reasonable. Thus, in this section we will redo [3] while moving 
the point in question to the center of the neighborhood, and then in the next section we will 
proceed to finding the 0(r 2 ) correction. 

According to our model, the field has two Lagrangians rather than one. In particular, the 
Lagrangians are C t and C s (where letters "t" and "s" stand for "timelike" and "spacelike") 
defined as 



£ s (0;r) = -(cP^X-l + sgn(d^d^)) (26) 



2 

where "sgn" is a "signum" function defined as 



C t (<f>;r) = Ud^d^)(l + sgn(d^d^)) (27) 



-1 


If 


X 


< 





if 


X 


= 


+1 


if 


X 


> 



sgn(x) = < if x = (28) 
{ +1 if x > 

and we are using (+, — , — , — ) metric convention. Thus, the "spacelike" Lagrangian returns 
zero if the gradient of (f> is timelike at a given point and it returns the Lagrangian density 
"with the wrong sign" if the gradient of <fi is spacelike. On the other hand, timelike La- 
grangian returns zero if gradient is spacelike, and it returns Lagrangian density with the 
correct sign if the gradient is timelike. After we have obtained C s and Ct, we will then define 
C by superimposing the two: 

C = C t - C s (29) 

where the minus sign is meant to "correct" the "sign error" inside of C s . It is easy to see 
that substitution of C t and C s into the above equation results in 

C = d^d^ (30) 

as expected. Now, each C s and C t need to be defined for arbitrary causal set. Since a general 
causal set is not manifold-like, we are not allowed to refer to coordinate system the way we 
do when we write partial derivatives. At the same time, however, we will have to show that 



8 



in the special case of manifold-like causal set the "general" definition reduces to coordinate- 
based one. Therefore, we will propose the following construction. We will define a Lagrangian 
generator to be 

/C(0; Sl , S2 ) = (0(s 2 )-0( Sl )) 2 (31) 
We will then define a Lagrangian generators J s and J t to be 

Js{<f>; P, 0) = max{(0(si) - <p(s 2 )) 2 \p ^* St ^* q , p ^* s 2 ^* q} (32) 

Jt(<t>;P,q) = max{(0(si) - 0(s 2 )) 2 |si = p , s 2 = q} (33) 
Thus, of course, J t simplifies to 

Jt(<P;p,q) = (<P(q)-<P(p)) 2 (34) 

while J^ s ((f)',p,q) would require a little bit more work. Finally, we will define Lagrangian 
densities at r as 

C s (<f); r) = min{ J s (p, q)\p -< r -< q , r(p, r) = r(r, g) = r} (35) 

C t {4>\ r) = min{ J t (p, q)\p -< r -< q , r(p, r) = r(r, q) = r} (36) 

Then after the Lagrangians have been defined by the above method they are manually 
subtracted: 

C = C t -C s (37) 

Strictly speaking, the definition of Lagrangian is given by Equations [3U [3H [321 [36J [35] and 
1371 None of these equations have any reference to coordinate system; thus, the Lagrangian 
is defined for abstract causal set. 

Let us now show that in a special case where the causal set happens to be manifoldlike 
(defined in terms of embedding /: S — > A4, we would, in fact, obtain Equations I2"61 and I2T1 
Geometrically, the fact that p -<* S\ implies that the Lorentzian distance between f(p) and 
f(si) is 0. In other words, f{si) lies in the light cone of f(p)- The fact that we have p -<* S\ 
as opposed to si -<* p means that f(s±) lies in the future light cone of f{p). Similarly, the 
fact that s\ -<* q implise that S\ lies in the past light cone of q. Geometrically, it is easy 
to see that these two conditions imply that s\ lies on the surface of the "equator" of the 
Alexandrov set a(p, q). In other words, if we define our coordinate system in such a way 
that 

f(p) = (-T, 0,0,0), f(q) = (r, 0,0,0) (38) 
then the t-coordinate of si is zero, 

si = (0, £1,1/1,2:1) (39) 

and its space coordinates satisfy 

x\ + yl + zl = r 2 (40) 

The same, of course, is true for s 2 : 

S2 = (0,x 2 ,y2,z 2 ) , x\ + yl + z\ = t 2 (41) 
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Let us now consider the situation in which the gradient of is spacelike. We can select a 
coordinate system in which the spacelike part of gradient of points in z-direction. Thus, 



d X <f) = d y <p = (42) 

Now, up to linear order, the Lagrangian generator is 

JC s (<f>;p,q,s h s 2 ) = ((s^-sft)d^f (43) 

which, in our coordinate system, becomes 

P. 9, si, s 2 ) = ((z 2 - z^d^f (44) 

Therefore, it is being maximized by the choice of points si and s 2 that lie on z-axis. The 
maximization of "Lagrangian generator" K. is identified with "pre-Lagrangian" J: 

d x <P = d y <j> = J = Ar 2 d z( f> 2 (45) 

This generalizes to rotationally covariant (but not Lorentz covariant) expression 

i 7 = 4r 2 |V0| 2 (46) 

Now, we would like to "minimize" J. Since the gradient of 4>m is spacelike, there is a frame 
in which its time component is zero. It is easy to see that if we will first start from that 
frame and then perform Lorentz boost away from that frame, then |V0| will increase. This 
means that the frame in which the time component is zero is the one where minimization of 
J occurs. The minimum of J is identified with C. Thus, 

«9 o = O^£ s =4r 2 |V0| 2 (47) 

This generalizes to a Lorentz covariant expression 

C s = -4r 2 ^09 M (48) 

where the minus sign comes from (+, — , — , — ) metric convention. The above was derived for 
the case where the gradient is spacelike. Now, if the gradient happens to be timelike then 
Cs is still formally defined. This time, however, it is equal to zero. After all, if the gradient 
is timelike, we can select Alexandrov set in such a way that d^cf) points in a direction parallel 
to — p M and, therefore, perpendicular to equator. This would imply that for arbitrary 
s\ and s 2 lying on the equator, (f)(s2) — 0(si) = 0. Thus, for arbitrary s\ and s 2 on the 
equator JC s (<ft] S\, s 2 ) = 0. This means that "maximum" over "all possible" K, s "throughout 
equator" is as well. Thus, Js is zero. Now, since all of the expressions involve squaring, 
none of them are negative. Therefore, the fact that Js is zero "at least once" means that 
the minimum of Js is zero as well. Finally, since C s is identified with a minimum of Js, this 
implies that 

0^0>O=^£ s (0;r) = (49) 
Finally, putting together Equation HH] and H9] gives us 

C s = 2r 2 (dyd ll <f ) )(-l + sgnidyd^) (50) 
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Let us now move on to C t . Again, we have to do it for two cases: the gradient of being 
timelike and spacelike (and in the former case we will get well known Lagrangian density 
while in the latter case we will get zero). Let us start from the case of d^cj) being timelike. 
By definition, the points s\ and S2 are "constrained" to coincide with p and q respectively. 
Thus, the "maximum" over s\ and S2 trivially reduces to the corresponding expression over 
p and q. Thus, 

J t (fr p, q) = K t (<j>; p, q) = (<P(q) - <P(p)) 2 (51) 
If we assume that is linear, this becomes 

W;p,?) = ((/ /i (g)-/ M (p))M a ( 52 ) 

We will choose coordinate system in such a way that the gradient points along t-axis. In 
this case the above expression becomes 

Jt(<P;P,q) = (f (q)-f°(p)) 2 (d <j ) ) 2 (53) 

In order to minimize J t , we have to minimize f°(q) — f°(p)- Since the constraint of min- 
imization is r(p,r) = r(r,q) = r, it is easy to see that the minimum is achieved when 
/ M (g) — f^(p) is parallel to i-axis; that is, 

F(q) - F(r) = F(r) - F(p) = (54) 

We now substitute the above into J t and identify the result with C t : 

C t = 4r 2 (80(f)) 2 (55) 



Since the above equation was derived in a frame where dk<p — 0, this generalizes to a Lorentz 
covariant expression 

C t = Ar 2 dyd^(p (56) 

Let us now assume that d^cj) is spacelike. In this case, it is possible to select Alexandrov set 
in such a way that its axes point perpendicularly to the gradient of <fi. This will immediately 
imply that K t = and J t = 0. Since JC t and J t involve only quadratic expressions, they are 
both non-negative. Thus, the fact that Jt coincides with zero at least once implies that the 
"minimum" of Jt is zero. Now, since the minimum of Jt is identified with £ 4 , this implies 
that 

3^0 M 0<O=S-A(0;r) = O (57) 
Puting together Equations [56] and [571 we obtain 

C t = 2r 2 (9>^0)(l + sgnid^d^) (58) 

Finally, combining the Equations [50] and [58] we obtain 

C = C t -C s = 2r 2 ^0^0 (59) 

The moral of the story is that, due to the lack of reference to coordinates in the original 
definitions of Lagrangians, we are bound to obtain relativistically covariant result. At the 
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same time, however, we can have "relativistically covariant" things that are "not observed 
in nature", such as sgn(<9 M 0<9 M 0). In order to "get rid" of these things we need to find 
"by hand" an appropriate linear combination of Lagrangians that would cancel unwanted 
quantities. Similar situation will arise in electromagnetic case. We will have two different 
Lagrangians: one is "electric" and the other is "magnetic" . Both will be expressed in Lorentz 
covariant form. But we would have "unwanted" contraction ea^sF^F^ 6 . That contraction, 
however, would get canceled when we find appropriate linear combination of "electric" and 
"magnetic" Lagrangians. 



4. Higher order correction for scalar field 

Let us now compute higher order corrections to the scalar field Lagrangian. In the previous 
section we have found out that the Lagrangian is of 0(r 2 ). Let us now agree to the next 
order we are looking at. Let us denote by 4>m a "linear" approximation and by <p' M its 
modification. If /(si) and f(s 2 ) maximizes (4>M-(f(s 2 )) — 0a^(/(si))) 2 , the points f(s[) and 
f(s' 2 ) maximize (0^vi(/(s' 2 )) — Then the "corrected" expression will take the form 

- = lMf(s 2 )) - <p M (f(si))] + M M (f(s 2 )) - <p M (f(s2))]- 

- W M {JM) ~ Mf(si))} + W M (f(s' 2 )) - <P' M (f(s 2 ))] - MmWi)) - 4t M iS(*i))\ ( 6 °) 

Now, up to the first order, ip M and ip' M agree. Therefore, when we look at ip' M (s) — ?Pm{ s ), 
we are referring to the second order or higher. Now, we recall from previous section that the 
points f(si) and f(s 2 ) are "exactly opposite" to each other relative to the center: 

f(si) = -f(s 2 ) (61) 

This statement is no longer true for f(s' 1 ) and f(s' 2 ), but it is still true for f(s\) and f(s 2 ). 
This means that the even order terms in 0^i(/(s2)) — <Am(/( s 2)) and $vt(/(si)) — <Am(/( s i)) 
will match. Since in the above expression one is being subtracted from the other, they will 
cancel. Therefore, the contributions from second and third term on the right hand side will 
come from 0(r 3 ) as opposed to 0(r 2 ): 



'M 



(f(s 2 )) - Mffr))] - [0m(M)) - Mf(sx))] = 0(r 3 ) (62) 



As far as the last two terms of Equation [60] are concerned, the symmetry arguments no longer 
apply since the displacement between we know that displacement betweein "primed" and 
"unprimed" points broke that symmetry. Now, the displacement that we have just mentioned 
is "much smaller" than the size of the Alexandrov set. Since the size of Alexandrov set is 
of 0(r), the displacement is of 0(r 2 ). Therefore if the derivatives were "finite" this would 
have, in fact, produced 0(r 2 ) difference. However, we recall from the previous section that 
we have chosen a coordinate system in such a way that 



dx 



dy 



(63) 



Since primed and un-primed points are located in the equator of the Alexandrov set rather 
than the origin, d x 4>M and d y (pM is 110 longer zero at these points. However, due to the fact 
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that their displacement from the origin is r, we know that the x- and y- derivatives are of 
0(t) as well: 



dx 



0(r) 



>M 



dy 



0(r) 



(64) 



As a result, the effect of these derivatives upon 0(r 2 ) displacement between primed and un- 
primed points leads to the field difference of the order of 0(r) x 0(r 2 ) = 0(r 3 ). On the other 
hand, if we consider z- derivative, then it is, in fact, "finite": 



'■>M 



dz 



0(1) 



(65) 



After all, the "smallness" of x- and ^/-derivatives comes from the assumption that they are 
zero at the origin; we made no such assumption regarding ^-derivative. At the same time, 
however, the ^-displacement of "primed" points relative to un-primed is of 0(r 4 ) rather than 
0(r 3 ). This is due to the fact that, as was explained in the previous section, the un-primed 
points lie on z-axis. After all, constraint that all of the points are "on the surface" of the 
equator thus implies that 



z' = \J V 2 — x 2 — y 2 = t\I 1 



x 2 + y z 



T- 



Now, since we already agree that 



we also know that 



Therefore 



x = 0(r 2 ) , y = 0(r 2 ), 



x 2 + y 2 



0(r 2 



r\ 1- 



x 2 + y 2 



T 1-r 



1 x 2 + y 1 



(66) 

(67) 
(68) 

(69) 

(70) 
(71) 

At the same time, as mentioned earlier, the ^-derivative is of 0(1). Thus, the contribution 
of derivative is 0(1) x 0(r 3 ) = 0(r 3 ). This means that all three derivatives contribute 
0(r 3 ) towards the last two terms of Equation [60J, even though the 0(r 3 ) arises for different 
reasons (in case of x and y it arises as 0(r) x 0(r 2 ) = 0(r 3 ) and in case of z it arises as 
0(1) x 0(r 3 ) = 0(r 3 )): 



+ 0(r 4 ) =r(l + 0(r 2 )) = r + 0(r 3 



Thus, the assumption that 
implies that 



Z = T 



z = 0(r 3 



dx 



6x + lf 
/(si) ay 



/(si) V + dz 



5z + 0((Sx) 2 ) 



0(r) x 0(r 2 ) + 0(r) x 0(r 2 ) + 0(1) x 0(r 3 ) + 0(r 4 ) = 0(r 3 ) 



(72) 
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Now, if is being modified by 0(r 3 ), then the kinetic term of the Lagrangian will be modified 
by 0(r 4 ). After all, if we alter we alter 



(j>(s 2 ) -> 0(s 2 ) + 5(f) 



(73) 



the Lagrangian generator is being altered according to 

(0(s 2 ) - 0( Sl )) 2 (0( S2 ) - 0( Sl )) 2 + (0(s 2 ) - 



The fact that 



(s 2 ) - = 0(r) , 



Ofr 3 



(74) 



(75) 



implies that 

(0(s 2 ) - 0(si))50 = 0(r) x 0(r 3 ) = 0(r 4 ) (76) 

It should be emphasized though that 0(r 4 ) corrections towards the Lagrangian density were 
produced from 0(r 3 ) corrections to scalar field. This means that 0(r 4 ) corrections to scalar 
field will have an effect of 0(r 5 ) or higher. Thus, we only need to compute the scalar field 
up to 0(r 3 ). The expression for up to that order is 



MiV)) = *(/(*)) + ^ 



d</> 
x + «~ 

/(«) 9y 



/( S ) y dz 



T 



— a/t 2 — x 2 — 



+ 



\d 2 4> 



2dx 2 



x 



ld 2 (f) 



d 2 (j) 



dxdy 



/(«) 



(77) 



f(s) 2 9y 2 

In the above expression, we didn't have d 2 (f>/dxdz, d 2 (f>/dydz and d 2 (f>/dz 2 . The reason is 
that the coefficient in front of d(f)/dz is of 0(r 3 ); thus we would expect that the coefficients 
next to higher order derivatives that happen to include z would be of 0(r 4 ) or higher. Since, 
as mentioned before, we are computing up to 0(r 3 ), we can throw these terms away. 

Now, in order to find the displacement of point s' relative to s, we have to find the ex- 
tremum of </>x(/(s')) given above. As usual, we find the extremum by taking the derivatives 
with respect to x and y and equating them with zero (we don't need to do the derivative 
with respect to z since we have re-expressed z as a function of x and y). Thus, 



dx 

dcj) 
dy 



x 



/(») v/r 2 - x 2 - y 2 dz 



f( s ) dx 



d 2 4> d 2 <p 
f( s ) dxdy 



y + (r 2 ) = (78) 



V 



f(s) y/l 



x" 



,2dz 



d 2 <t> 

H - 

f(s) dy 2 



d 2 4> 

f(s) dxdy 



x + 0(r 2 ) = (79) 



The above derivatives are computed up to 0(r 2 ) rather than 0(r 3 ) because the derivative of 
0(t 3 ) is of 0(t 2 ). This, however, does not change the fact that the values of extreme points 
of two functions differing by 0(r 3 ) from each other still differ by 0(r 3 ) (one can construct 
a proof to this effect even without reference to derivatives). Therefore, we trust ourselves 
that if we start off from functions defined up to 0(r 3 ) and follow our noses, we will obtain 
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an 0(t 3 ) accuracy at the end, even if we will be "running into" 0(r 2 ) along the way. Now, if 
we multiply the above expressions by r, we obtain 



dx 



dy 



x 



x 2 +y 



2 dz 



x 2 + v 2 dz 



d 2 (f) 

H - 

f( s ) dx 2 



d 2 (f> 

H - 

/(») <% 2 



<9 2 



<9 2 

/(s) T 0X0?/ 



/(«) 



yr + 0(r 3 ) = 



xt + 0(r 3 ) = 



50) 



(81) 



In light of the fact that x and y are of 0(r 2 ), we know that 



x 2 + y 2 



T 



0(r 2 ) 



(82) 



This means that the second terms in the above two expressions are equal to x + 0(r 3 ) and 
y + 0(r 3 ), respectively. Furthermore, again from the fact that x and y are of 0(r 2 ), we know 
that xt and yr are of 0(t 3 ). In other words, the last two terms in Equations I8"U1 and I8T1 are 
of 0(t 3 ). But, as we mentioned earlier, the increment of would be a product of the x- and 
y- displacements by d x <p and y which, itself, is of 0(r). Thus, in order to know (ft up to 
0(r 3 ) we only need to know x and y up to 0(r 2 ). Therefore, we throw away all of the r 3 
terms thus simplifying the above expressions: 



dx 



T 



dy 



/(«) 



x 



y 



dz 



dz 



/(«) 



0(r 3 



+ 0(r 3 











(83) 
(84) 



Now, since the x- and y- derivatives at the origin are exactly zero, their values at f(s) are 
given by 

d(f) d 2 4> d(f) d 2 4> 

dx f(s) dxdz o ' dy /(s) dydz o 

The above was computed to 0(r) because in Equations E3] and [84] these derivatives are 
multiplied by r which would turn 0(r) into 0(r 2 ), and 0(r 2 ) is the order up to which we are 
doing our calculation. By substituting Equation [85] into Equations [83] and [84] we obtain 



d 2 



T 



dxdz 
d 2 ch 



dydz 







dz 


m 


d(f) 
^ dz 


/(«) 



+ 0(r 3 ) 
+ 0(r 3 ) 











56) 
57) 



Now, the only reasons dcj)/dx and d<fi/dy are "small" is that coordinate system is chosen in 
such a way that the gradient of <fi is parallel to z-axis at the origin. This implies that d<fi/dz 
is large. Thereore, we immediately obtain the expression up to 0(r 2 ) for x and y: 



2 d 2 <P/dxdz 3 

X 1 J 



_ 2 d 2 <p/d y dz 3 
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where we have dropped the indications of points at which the partial derivatives are being 
evaluated since that would lead to 0(r) x 0(r 2 ) = 0(r 3 ) effect which we are ignoring. We 
can now use these x and y to compute z: 



x 2 — y 2 



(d 2 cj)/dxdz\ 2 /d 2 4>/dydz\ 2 \ 
\ d<f)/dz ) + \ d<p/dz ) ) 



rWl-r 2 



T 

~2 



d 2 4>/dxdz\ 2 
d<j)/dz ) 



d 2 cf)/dydz 



(89) 



d(f)/dz 

Thus, we would like to compute the value of ^^(s^), up to 0(r 3 ). Since the deviation of 
z is of 0(r 3 ), we only need the first derivative with respect to z up to finite order. On the 
other hand, since the deviations of x and y are of 0(r 2 ), we need to know respective first 
derivatives up to 0(r). But, as stated earlier, the x- and y- derivatives are of 0(r) to begin 
with! Thus, again, we only need a leading order terms of first derivatives, just for a different 
reason. Therefore, we don't need any second derivatives at all. Thus, we use 



<f>M(f(s')) - (j>M(f(s)) = x- 



dx 



/(«) dy 



/(«) 



+ {z-r)- 



dz 



0(r 4 



(90) 



By substituting the Equations EE] and EH] into the above, we obtain 



MM)~MM) 



, d 2 (j)M/dxdz dcj)M 



d(j>M 1 9z dx 



/(») 



+ r 



,d 2 4> M /dydz d(j) M 



d(j) M /dz dy 



+ 



d 2 4> M /dxdz\ 2 fd 2 (j) M /dydz\ 2 \d(j) M 



/dz ) 



+ 



dcf> M /dz 



dz 



f{») 



+ 0(r 4 



(91) 



Now, by substituting Equation E3 into the d/dx and d/dy terms above, we obtain 

MM) - MM) = T v + r 



\d 2 cj) M /dxdz d 2 (p M z d 2 <p M /dydzd 2 i 



d<p M /dz dxdz 



f(s) 



T 
~2 



M /dz dydz 
+ 0(r 4 ) 



+ 



/(*) 



/(*) 



(92) 



~ :! " / d 2 4> M /dxdz\ 2 /d 2 (p M /dydz\ 2 \ d<pM 
- d(j)M./dz ) V d(pM/dz J J dz 

Now, from the inspection of the above equation, one can see that the first and second terms 
match the third and fourth term, respectively, except for the factor of —1/2 that the latter 
two terms are multiplied by. Therefore, the above expression simplifies to 



M/OO) - Mf(s)) 



t 3 (d 2 4>M/dxdzy 



d<p M /dz 



(d 2 <p M /dydz) 



d(j) M /dz 



(93) 



Now, our eventual goal is to look at shifts of two different points: namely, a shift from s\ to 
s[ and a shift from s 2 to s' 2 . As we recall, we were assuming that si is located at (0, 0, 0, — r) 
and s 2 is located at (0, 0, 0, +r). Our previous calculations were assuming that z = +r and, 
therefore, the results apply to s 2 . If we are to use z = — r for f(si) the signs of odd-order 
terms will be reversed while the signs of even-order terms will stay the same. Since the above 
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expression does not have 0(r 2 ), the leading order correction is of 0(r 3 ) and, therefore, comes 
with the reversed sign: 



MMx)) - MM)) 



t 3 (d 2 4> M /dxdz)' 



d(f) M /dz 



r 3 (d 2 (f) M /dydz) 



d(j) M /dz 



(94) 



On the other hand, since s 2 is located at z = +r, the expression for s 2 does not change sign. 
Thus, 



Mf(s 2 ))~MM)) 



t 3 (d 2 (j) M /dxdzy 



d(f) M /dz 



t 3 (d 2 (j) M /dydz) 
f(s 2 ) 2 d(f) M /dz 



(95) 



If we now subtract Equation [9U from Equation [951 an d move /(si) and f(s 2 ) to the right 
hand side while keeping f(s' 1 ) and f(s 2 ) at the left, we obtain 



Mf(s' 2 )) - Mf(4)) = MM)) - MM))+ 



+ 



r 3 (d 2 (f) M /dxdzY 



+ 



2 d(j) M /dz 
t 3 (d 2 4> M /dxdz) 2 



+ 



t 3 (d 2 4> M /dydz) 



/(,,) 2 d(f) M /dz 

t 3 (d 2 4> M /dydz) 2 



(96) 



d(f) M /dz 



f(si) + 2 



/(«0 



d(j) M /dz 

Now, the difference between the derivatives (of any order) evaluated at Si and s 2 is of 
0(r). Since the derivatives are being multiplied by r 3 , this 0(r) difference will become 
0(r) x 0(r 3 ) = 0(r 4 ) which we can ignore. Therefore, we can replace the derivatives "at 
/(si)" and "at f(s 2 )" with derivatives "at the origin". This will allow us to combine these 
for terms into two terms: 



MM)) - MMi)) = MM)) - MM)Y 



+ r 



(d 2 4>M/dxdzf 



d<p M /dz 



+ T 



3 (d 2 M /dydz) 



(97) 



d(j) M /dz 

Now, the first two terms on the right hand side have yet more higher order derivatives hidden 
in them. Namely, 



MM))-MM)) = *t- 



dz 



+ 



4r 3 <9 3 c 



'M 



3 dz 3 



(98) 



Thus, the final expression becomes 



Mf(4))-MMi))=2r^ 



+ 



4r 3 d 3 (j). 



M 



3 dz 3 



+ 



3 (d 2 cj) M /dxdz) 2 3 (d 2 (p M /dydz) 

+ T — — r T 



(99) 



the latter is 



d(f>M I dz d(pM I dz 

Now in order to get Lagrangian we need to square the above expression. Up to 0(r 4 ) terms, 
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J s = {MM)) ~ MMiW = 4r 2 



dz 



+ 



(100) 



+ 



16r 4 <9 3 0. 



M 



dz 3 



o dz 



o V oxdz 



o V oydz 



d 2 



So far we have found a way of maximizing (0x(/( s 2)) — < / > A / i(/( s i))) 2 f° r a /^ced Alexandrov 
set. The next step in the prescription outlined in the previous section is to look at the 
different possible Alexandrov sets, do the "maximization" within each one, and then select 
the one Alexandrov set that would minimize the "maximum". Now, the original Alexandrov 
set was selected in such a way that the gradient of 4>m is parallel to z-axis at the origin. 
This implies that, upon "very small" coordinate rotation, the first term on the right hand 
side varies quadratically. On the other hand, the last three terms on the right hand side 
vary linearly. In other words, the equation behaves as 



Js = Jo + ar 2 9 2 + br 4 9 



(101) 



where 9 is the angle of rotation. In order to minimize the above we have to find the place 
where derivative is zero with respect to 9: 







This implies that 



" I 2r\2 

d9 {aT6 



9 



br 4 



br 4 
2aT 2 



2ar 2 9 + br 4 



bf_ 
2a 



By substituting it back into Equation I101[ we obtain 



Js 



b 2 T 6 b 2 T 6 

Ja + — + 



3b 2 r 6 
4a 



4a 



2a 



(102) 



(103) 



(104) 



Since we are computing Lagrangian density only up to 0(r 4 ), this means that we are free to 
ignore the effect of "minimization" part. In other words, we don't have to rotate Alexandrov 
set from its original position, and simply copy the result of "maximization" (Equation II Q0[) 
for J as our answer for C. Since we are not going to need the Alexandrov set any more, we 
can think of it as "pointwise" Lagrangian where the "point" in which Lagrangian is evaluated 
is what used to be the origin. Thus, we will rewrite Equation 11001 while dropping " at the 
origin" signs: 



= 4r^ 



16r 4 d 3 (f) M d(f) M , ^ 4 /<9 2 <; 



+ 2r 4 



V 



+ 2t 



( 



d 2 



dz J 3 dz 3 dz \ dxdz ) V dydz 

Now, the above expression is true only in the coordinate system where 



(105) 



dx 



dy 







(106) 



We can, however, use the fact that x- and y- derivatives are zero in our particular frame in 
order to replace ^-derivatives with covariant expressions: 



d z \ 



diddiX 



d;X 



^ , dl\ 



d. 



%djd k X 



(dKPdKf)) 3 / 2 



(107) 
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Basically, we have replaced ^-derivative with a contraction to gradient of 4>m since the latter 
is supposed to be parallel with z-axis. If the amplitude of gradient of 4>m is something other 
than unity, this would pick unwanted scalar factors. In order to "get rid" of them, we divide 
the contraction by the appropriate power of the amplitude of gradient of (p. 

Apart from "getting rid" of z- derivative, we have to also "get rid" of x and y. In order 
to do it, we add and subtract (d z d z (f>M) 2 '- 

16r 4 

C s = AT 2 {d z <p M f + —dPjMd^u + 2r 4 d k d z( f )M d k d z( j )M - 2r\d z d z <p M f (108) 

This reduces the task of "getting rid" of x and y to the task of "getting rid" of z. We then 
use Equation 11071 to do the latter, obtaining 

C s = At d k (j) M d k (j) M + — J J - + 

3 Oi<PmOi<Pm 

di<f) M di(fi M \ d k <j) M d k <j) M J 

Finally, we would like to rewrite it in Lorentz covariant way. We recall from previous section 
that we originally chose the frame in which 

do<p = (110) 

This seem to suggest that we can blindly replace Latin indexes with Greek ones. Of course, 
we still have non-zero values of second derivatives involving time: 

d Q d^0 (111) 

However, by quick inspection of Equation 11091 we see that any second derivative that involves 
any given index is always coupled to the first derivative with respect to that index. Thus, 
if we were to replace the non-covariant indexes with covariant ones, the "unwanted" d p do<f) 
terms will be multiplied by d^cf) and, therefore, sent to zero. The only possible concern 
we might still have is that, due to the perturbations, the axis of Alexandrov set has been 
rotated which would result in non-zero value of time derivative. However, in the argument 
we have made in Equations 110X1 and 11041 the effect of such rotation is of 0(r 6 ) and, therefore, 
negligible as far as the precision of the calculations at hand is concerned. Thus, at the 
expense of extra 0(r 6 ) error, we will agree not to rotate the axis of Alexandrov set which 
will, in turn, allow us to use <9 O = 0. Thus, we will go ahead and rewrite Equation 11091 in a 
covariant form: 

3 o a (f) M d a (f) M 



d a (p M d (7 (f) M V d a (p M d (7 (f) M ) 

where the sign change in the first term on the right hand side is due to (+1, —1, —1, — 1) 
convention. 
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Now we recall that the calculation above was made under assumption that the grandient 
of 4>m happened to be spacelike at a point we are interested in. After all, this is what allowed 
us to align the gradient of with z- axis (as opposed to t-axis). Let us now consider the case 
when gradient is timelike. Let us select Alexandrov set in such a way that — f^(p) 

is parallel to (<9 M 0)(O), and let us select coordinate system in such a way that t-axis passes 
through f(p) and f(q) with origin at the middle. Thus, 

F(P) = , /"(?) = ^ (H3) 



dx 



dy 



dz 



(114) 



In this case, linear terms no longer contribute to Lagrangian generator. Now, in the previous 
calculation the presence of linear terms was the reason why we had to select si and s 2 in 
the nearly-opposite directions from the origin. Therefore in the present situation this is no 
longer the case. Now, one consequence of the fact that s± and s 2 were selected in the opposite 
directions from the origin was lack of contribution from second order derivative terms. Thus, 
the next order correction was coming from third derivatives. In our present situation, since 
Si and s 2 are no longer opposite, the second order terms begin to contribute. 

Apart from that, there is yet another difference. In the previous calculation, the 0(r 3 ) 
correction to lead to 0(t 3 ) x 0(t) = 0(r 4 ) correction to the Lagrangian. This time, due 
to lack of linear terms, 0(r) is being replaced by 0(t 2 ). Thus, the contribution of 0(r 3 ) 
correction to is 0(r 3 ) x 0(r 2 ) = 0(r 5 ). Since we are only computing Lagrangian up to 
0(r 4 ), we can neglect 0(r 5 ) effects on Lagrangian and, therefore, we can likewise neglect 
0(t 3 ) correction to 0. On the other hand, 0(t 2 ) correction to leads to 0(r 2 ) x 0(r 2 ) = 0(r 4 ) 
contribution, which we can not neglect. Thus, we will assume that is quadratic. 

Now, in light of lack of spacelike gradient of 0, we don't have any "preferred" space 
coordinate the way we had before; the only "preferred" coordinate we have specified so far 
is t. Therefore, we are free to perform spacelike rotations as long as t stays fixed. We will, 
therefore, rotate the spacelike coordinates in such a way that 3x3 matrix = d^djcf) is 
diagonalized, and its eigenvalues are Ai, A2 and A3. Thus, 



<9 2 



dx 2 



Ai, 



<9 2 



dy 2 



A 2 



d 2 (j) 



dz 2 



A 3 (115) 



The value of at an arbitrary point on the equator is 



«») = m + *£ + *£ + ¥- me) 

Therefore, the "maximum" and "minimum" of 0(s) are given by 

t 2 r 2 
0max = 0o + y max(Ai, A 2 , A 3 ) , min = O + y max(Ai, A 2 , A3) (117) 

This means that pre-Lagrangian is given by 

Js = — (max(Ai, A 2 , A 3 ) - min(Ai, A 2 , A3)) 2 (118) 



20 



Now we have to see whether or not we have to rotate the axis of Alexandrov set by a small 
amount in order to minimize J . Geometrically, it is easy to see that if a function is symmetric 
around the origin, then the rotation away from the symmetric state would only increase the 
difference between maximum and minimum. Since our goal is to minimize, the only instance 
where we would need to rotate is when the symmetry is already broken by odd-order terms. 
Now we already know that we don't have linear terms. Therefore, the only justification for 
rotation would be third-order terms. But we have already established that 0(r 3 ) terms in 
have 0(r 5 ) effect on Lagrangian which we ignore. Therefore, the effects of minimization of 
J s are, likewise, of 0(r 5 ) and are likewise ignored. Thus, we simply copy the expression we 
had for J into C without further modifications: 

C = — (max(Ai, A 2 , A 3 ) - min(Ai, A 2 , A 3 )) 2 (119) 

One should note that the above expression is not covariant. After all, we have only three 
eigenvalues rather than four, since we are referring to only spacelike components of second 
derivative, didjcj). We can obtain a covariant expression by replacing t axis with d^(j)/\d^(j)\. 
We thus define the following tensor: 

In this case D^ v will have four eigenvalues; but one of them will be zero. Thus, we would have 
to take a maximum and minimum out of {Ai, A 2 , A3, A4} \ {0}. In order to make ourselves 
completely safe for the situations in which the "zero" eigenvalue will deviate from 0, we will 
instead do {Ai, A 2 , A 3 , A 4 } \ (— e, e). Thus, we have 

C s = ^(maxtfAx, A 2 , A 3 , A 4 } \ (-e, e)) - min({A 1 , A 2 , A 3 , A 4 } \ (-e, e))) 2 (121) 
Thus, if we bring together the case of spacelike and timelike gradient, we obtain 

C s = (-1 + sgn(d^d^)) \lr 2 d^ M d^ M - 



2 8r 4 d^4> M d v 4) M d p 4) M d lx d v d p 4) M 



d a (p M da(pM \ d a (j) M d (T (f) M ) 



d <J (pMd ( j<pM 

'+ (122) 



r 2 

+ — (1 + S ( ? n(9^0))(max({A 1 , A 2 , A 3 , A 4 } \ (-e, e)) - min({Ai, A 2 , A 3 , A 4 } \ (-e, e))) 2 

So far we have found an expression for C s . Let us now discuss C t . In this case the calculation 
will be considerably simpler since the two points we are looking at will simply be p and q 
so we would no longer need to consider their displacements. The pre-Lagrangian simply 
becomes 



Now we need to rotate Alexandrov set in a way that the above expression is minimized. 
Let us start from timelike case. In case of (pM being linear, we have to align the axis of 



4r 3 <9 3 0\2 2 /<90\2 16r 4 <90(9 3 
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Alexandrov set with the gradient of the field, as we have done in the "timelike" part of the 
previous section. If, on the other hand, <$m is non-linear, we can repeating the argument 
similar to Equations 11011 and 11041 to show that the small Lorentz transformations would only 
result in 0(r 6 ) corrections which we don't care about. Therefore, we will identify the above 
expression for J with a Lagrangian C: 

r (A(\ A( \V (o d( t> 4r 3 0Vy „ 2{d<py 16r 4 dc/> d 3 cf> 

= Wq) - m) = \ 2T dt o + — w) = 4r y-m) + —mw (124) 

Finally, by using the fact that space derivatives of </> are zero, we can generalize the above 
to Lorentz covariant expression: 

A= ^ w+ y^y (125) 

Now let us consider the case where the gradient is spacelike. In this case, we can pick an 
Alexandrov set in such a way that (fi(q) = (p{p) exactly holds, setting C t to exact zero: 

<9 M 0«9 M < =>■ A = (126) 

This should be contrasted with the fact that C s is not an exact zero in case of timelike 
gradient (see Equation 11211) . The reason for this is that in case of C s we are looking at "a 
lot of points (namely all of the points across the equator of Alexandrov set) whereas in case 
of C t we are looking at only two points (namely p and q). In case of two points, we can 
manually move them in such a way as to get an exact match. On the other hand, in case 
of several different points, the "rigidity" of the shape prevents us from doing it: manually 
adjusting some of them would compromise adjustment of the others. Thus, in case of several 
points the best we can do is to "trust" some specified order of derivative, which would lead 
to higher order deviations. Anyway, Equations 11251 and 11261 can be summarized as 

A = (1 + sgn^M + X ^Xtf^ ) <127) 

Finally, by using 

C = C t -C s (128) 

we obtain 



C s = 4r 2 d»d u 6 + — - n n v-wvy i x + 



+ (-1 + sgn{d p <pd^)) — — — — g p - + (129) 

L 3 d a (p M d a (p M 

i dv-(j)Md l '(j)MdLid p (f) M d u dp(f) M 4 fd fl (j) M d u (j) M dpd u (f) M 
+ T o — — ; r 



d a (f) M d a (j) M V d a (p M d a (j) M 



r 1 



_ _ (1 + s ^(^^))(max({A 1 , A 2 , A 3 , A 4 } \ (-e, e)) - min({A 1 , A 2 , A 3 , A 4 } \ (-e, e))) 2 

One should note that the only 0(r 2 ) term is 9 M 0(9 M 0, and it comes without sgn. This is similar 
to the result of previous section when sgn canceled out after we performed a subtraction 
C = C t — C s - However, in our present situation, sgn still contributes to higher order terms, 
which we were ignoring in the previous section. 
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5. First order Lagrangian density for electromagnetic field 



Let us now describe electromagnetic Lagrangian. Similarly to what happened with scalar 
field, we will need two Lagrangian generators: a "spacelike" and a "timelike" ones. Naturally, 
we will call the former "magnetic" and the latter "electric". And, again, similarly to scalar 
case, magnetic and electric Lagrangians will each be Lorentz covariant on their own, but 
we will see some unwanted terms. These terms will be perfectly covariant and the only 
"problem" with them is a simple fact that they were never observed in the lab. Fortunately, 
they will end up canceling out once the two Lagrangians are added. 

In scalar case, we have identified Lagrangian generator with (0(s 2 ) — 0(si)) 2 . In the 
current situation, we will define it to be a four-point function, 

/Cs(si, s 2 , s 3 , s 4 ) = (a(si, s 2 ) + a(s 2 , S3) + a(s 3 , s 4 ) + a(s 4 , si)) 2 (130) 

Now, if there is an embedding /: S — > M, then this Lagrangian generator becomes 

£s(si, s 2 , s 3 , s 4 ) = (a 5 (si, s 2 ) + a 5 (s 2 , s 3 ) + a s (s 3 , s 4 ) + a 5 (s 4 , s x )) 2 = 

= (a M (f(si), f(s 2 )) + a M (f(s 2 ), f(s 3 )) + a M (f(s 3 ), f(s 4 )) + a M (f(s 4 ), /(si))) 2 = 

0rf( s 2) rf{s3) rf{sA) rf(si) . 2 

/ A^dx^ + / A^dx^ + / A^dx** J = (131) 
/(Sl) ■//(«) ^/(«3) ^/(«4) 7 

= ( / =( f B- da) 2 = Flux 2 

V Aoop(/(si),/( S2 ),/(s3),/(s4) 7 V isqu a rc(f(s 1 ,f(s 2 ),f( S3 ),f(s4))) 7 

where the "magnetic field" is taken in the reference frame in which t-axis is identified with 
the line passing through f{p) and f(q) and, as usual, we were able to use our knowledge of 
coordinate-based calculus (such as Stoke's theorem) because we were dealing with the space 
in which f(sk) are living in, as opposed to the space where Sk do. 

Now, for any given Alexandrov set, we need to evaluate the maximum of above expres- 
sion. For our convenience, let us denote /(si), /(s 2 ), /(s 3 ) and /(s 4 ) by A, B, C and D, 
respectively (in other words the lower-case letters, such as s, denote the elements of S, while 
the upper-case elements, such as A, denote the elements of M). Thus, we are interested in 
positioning points A, B, C and D on the surface of equator of Alexandrov set in such a way 
that the flux through the contour ABCD is maximized. Now, it is easy to see that the flux 
through the rectangle embedded in the sphere is less than or equal to the flux through the 
square embedded inside the circle, where we assume that the radius of the circle is the same 
as the one of the sphere (which, in our case, is r). Now, in case of embedding of rectangle 
inside the circle, 

Area(ABCD) = Area(AOB) + Aiea(BOC) + Area(COD) + Area(DOA) (132) 
Furthermore, if we let E be an intermediate point on the segment AB, then 

Area^OB) = Area(AOE) + Arez(BOE) = AE * 0E + BE * 0E (133) 
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If we denote the angle AOB by 9, then 



6 9 
AE = BE = r sin - , OE = r cos - (134) 



By substituting this into Equation I133[ we obtain 

e e t 2 



Area(AO-B) = r 2 sin ^ cos — = — sin 9 (135) 



Therefore, the area of the rectangle is 



r 2 



Area(ABCD) = — (sin AOB + sin BOC + sin COD + sin DO A) (136) 

Now, let us find the derivative of this area with respect to displacement of point B. In other 
words, we will replace B with B', and assume that the angle BOB' is equal to e. Then 

Area(AB'CD) = — (sin AOB' + sinB'OC + sin COD + sin DO A) = 

T 2 

= —(sin{A0B + e) + sin(B0C - e) + sin COD + sin DO A) = (137) 

= — (sin AOB + sin BOC + sin COD + sin DO A + e(cos AOB - cos BOC) + 0(e 2 )) 

In light of the fact that e can be both positive or negative, we can always pick a particular 
sign of e that would increase the flux through the contour as long as angles AOB and -BOC 
are not equal to each other. This implies that at the maximum AOB = BOC. By similar 
argument, we also know that BOC = COD, COD = DO A and DO A = AOB. This means 
that every single angle is tt/2. In such case, the area becomes 

T 2 /7T 77 77 77\ 2 

max(Area) = — sin — h sin — h sin — h sin — = 2r (138) 
v ; 2 V 2 2 2 2/ 

This implies that the magnetic flux through the contour is 2r 2 \B\ 2 . Now that we have done 
the maximization of the flux for a given Alexandrov set, let us do the minimization over the 
collection of different Alexandrov sets. This amounts to the selection of reference frame in 
which \B\ 2 is minimized. We claim that such frame coincides with the frame in with E and 
B are parallel. This argument will consist of three parts: 

a) Show that the frame where E and B are parallel exists to begin with 

b) Show that in such frame the value of \B\ 2 is minimized 

c) Find a covariant expression in arbitrary frame that coincides with \B\ 2 in the frame 
specified above 

Let us start with the proof of the existence of that frame. Let us start with arbitrary E 
and B and then rotate the frame in such a way that they end up being parallel. Without loss 
of generality, we can assume that .E-field is parallel to x-axis, while 5-field lies in x?/-plane: 

E=(E x ,0,0) , B = (B x ,B y ,0) (139) 
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When we make a Lorentz boost in ^-direction, these fields transform according to 

E' = ( 1 (E x -vB y ) n vB x ,0) (140) 

B' = (iB xll (B y -vE x ),Q) (141) 
In order for E' and B' to be parallel, we need to have 



or, in other words, 

vB x By- vE x 



E x — vB y B x 



(142) 



(143) 



This can be rewritten as a quadratic equation, 

(1 + v 2 )E x B y - v(E 2 x + Bl + flj) = (144) 

This equation solves to 



El + BI + B 2 - J (El + Bl + m)* - AEIBI 



vi = Vb (M5) 



EI + BI + B 2 + J (Ei + Bi + Bi)i - AEIBI 



V2 = "~^r-B (146) 

lJii x r5 y 

We now have to show that at least one of these solutions is physical. First of all, we note 
that neither of these two solutions have imaginary part. After all, 

(E x - By) 2 > (147) 

which implies that 

2E x B y < El + Bl (148) 

and, therefore, 

(E 2 X + Bl + B y ) 2 - 4ElB 2 y > (E 2 X + B 2 y ) - AE 2 x B 2 y > (149) 

This, however, is not enough. We also have to show that at least one of these two velocities 
(v\ or t> 2 ) is between —1 and 1. It turns out that v 2 does not satisfy this condition. After 
all, 

E 2 X + B 2 X + B 2 + J (Ei + Bi + Bi)i - AEIBI E 2 + B 2 

v 2 = > > 1 (150) 

2E x B y 2E x B y 

where Equation 11481 was used at the last step. Since v 2 does not fall between —1 and 1, we 
must show that Vi does. In order to do it, we first note that 



Va 2 -2ab + b 2 = a-b (151) 
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and, therefore 



Va 2 — b 2 > a — b 



(152) 



This, in particular, implies that 



^{El + Bl + B 2 ) 2 - AEIBI > El + B 2 X + B 2 - 2E X B ) 



y 



(153) 



Therefore, 



El + Bl + Bl - J(El + Bl + B 2 ) 2 - AEIBI 



< 



2E x B y 

El + Bl + Bl - El - Bl - Bl + 2E X B 



1 



(154) 



2E x B y 

Apart from showing that v\ < 1 we also have to show that v\ > —1. This part can be 
done by simply showing that V\ > 0. Now, we have already shown that the expression 
under square root is positive (see Equation I149p . Furthermore, we trivially know that that 
expression is less than [El + Bl + By) 2 . This, together with the fact that it is positive, 
implies that its absolute value is smaller than {El + Bl + B 2 ) 2 , as well. Thus, the value 
of square root is smaller than El + Bl + B 2 . Since square root is being subtracted from 
El + Bl + B 2 , this shows that the result of subtraction is positive, thus implying that V\ > 0. 
This, together with the previously shown result V\ < 1 implies that \v\\ < 1, which implies 
that Vi is physical. 

Technically, we have to show that the strict inequality \v\\ < 1 holds. This part is easy. 
By repeating the above arguments while inserting the assumptions E x ^ and B y ^ we 
would, in fact, obtain the strict inequality. If, on the other hand, we have E x = 0, then 
we have E = 0, which means that E and B are already "parallel" and no boost is needed 
to begin with. If, instead, B y = then, again they are parallel: both point along x-axis. 
Thus, again, no further boost is needed. In both of these cases, however, the Equation 11451 
would produce 0/0. Thus, they would be "parallel" strictly due to qualitative argument we 
have just presented. If, on the other hand, both of the numbers are non-zero then the strict 
inequality will begin to hold and, therefore, we would be able to find v i < 1 that would make 
them parallel. 

Now that we have proven the existence of the frame where E and B are parallel, let us 
show that in this frame the value of \B\ 2 is minimized (see part "b" in the last outline). Let 
us start from the frame where E and B are parallel and see how they transform. We can 
assume that their common direction is x. Without loss of generality, we can assume that we 
are doing boost along z-axis, which leads to the transformation 



&= (yE x ,yoB x ,0) 



(155) 



£' = (7^,-7^,0) 



(156) 



This implies that in the new frame 



B'\ = yy/Bl + v 2 E 2 x > yB x > B x 



(157) 
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Since in the original frame we have 

\B\ = B x (158) 

we have shown that 

\B'\ > \B\ (159) 

Thus, in the frame where E and B are parallel the flux is, in fact, minimized. 

Let us now arrive at a covariant expression for the above. On first glance, it seems like 
it is "impossible" due to |I?| 2 not being covariant. However, in light of the fact that the 
above computation was done in one specific frame (namely, the one where E and B are 
parallel), it is logically possible that the "minimal flux" corresponds to covariant expression 

— * icy — * — * 

in all frames, but that expression "matches" \B\ only in the frame where E and B are 

— * — * 

parallel. Let us, therefore, find a covariant expressions which \E\ and \B\ match in the above 
said frame. In order to agree with convention used in the rest of the paper, let us change 
coordinates and assume that the common direction E and B point to is z rather than x. 
Thus, the only non-zero components of F^ u are F 3 and Fi 2 . Therefore, in that frame, 

\E\\B\ = F 03 F 12 = L M5 F^F^ (160) 

o 

\B\ 2 - \E\ 2 = F m F m + F 12 F 12 = F afi F afi (161) 
The linear combinations of the above two equations lead us to 

{\B\ - \E\) 2 = F a ?F a p - le^F^F^ (162) 
(\B\ + \E\) 2 = F a/3 F a/3 + -^F^F^ (163) 

— * — * 

If we assume that \B\ > \E\, then we obtain 



B\ | E\ \j F°PF aP - l -e aM& F^F^ (164) 



\B\ + \E\ = F a PF a/3 + ^e aPlS F^F^ (165) 
which implies that 

\B\ = UsJf^F^ + ^eofrsF^Fi* + tJf^F^ - \e a p l5 F^FA (166) 



\E\ = \ [\jF^F a p + ^eafrsF^F-rS - ^F^F aP - \e aPl5 F^F^ (167) 

— * — * 

If, on the other hand, we assume that \B\ < \E\, then we obtain 

|^| - \B\ = S jF^F aP -- A e aPl5 F^F^ (168) 
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\B\ : \B\ = \;F^F a p + ^e aMS F^F^ (169) 



which implies that 



\B\ = ~ f J F«PF a p + ^e aM5 F^F^ - J F^F aP - \e aP ^F^F^ j ( I 70 ) 



\E\ = \ U/F^F^ + -e^F^F?* + yj F a @F a p - \e aMS F^FA (171) 

Now in order to neatly write down these two cases in one equation, we introduce "signum" 
function defined as 

r -1 If x < 

sgn(x) = < if x = (172) 
{ +1 if x > 

Thus, the above equations would have to use sgn{\B\ — \E\. But, of course, we do not want 
any references to B and E on the right hand side. What comes to our rescue is the fact that 

F^F^ = \B\ 2 - \E\ 2 (173) 

This means that 

sgn(\B\ - \E\) = sgn(F^F^) (174) 

We can, therefore, use the "covariant" expression sgn(F^ u F^ in order to summarize the 
"case by case" situation we are dealing with: 



\B\ = \ ( y F a PF aj3 + X -e a ^F^F^ + (sgn(F^F^))J F^F a0 - ^e aM5 F^F^ \ ( 170) 



\E\ = i {^F^F^ + - A e aM5 F^F^ - (sgn(F^F^))^ F^F aP - -e.^F^F^ ) (176) 
Squaring the above two equations implies 



|£|2 = 1 | paftp^ + ( S gn(F^F^))J(F^F a tf - L^FWF*)* ] (177) 



\E\ 2 = l - (f^F q/3 - (sgn(F^F^))^/(F«PF a py - l( 6a ^ s F^F^j (178) 

Now, as we recall, we were selecting the four points lying in the equator in our definition for 
Lagrangian generator. This resulted in \B\ 2 being the only contributing term. As we see from 
the equation 11771 the absence of E does not stop us from still obtaining the relativistically 
covariant result. The only "problem" we encounter is the presence of e^sF^ F^ 5 , which is 
not seen in the lab. This term, however, is still covariant! This is, in fact, what one would 
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expect. After all, our construction did not appeal to any coordinate system; thus, the result 
has to be covariant no matter what. 

Now, in order to get rid of e a ^F a ^ 'F 7<5 we introduce a separate Lagrangian which has 
\E\ 2 alone. As quick inspection of above equation shows, this will introduce e ol i3 1 $F a P F~' 6 with 
an opposite sign, allowing for cancellation. Now, the way we introduce the |i?| 2 -Lagrangian 
is by again considering the four-point sets, with the same four-point Lagrangian generator, 
while modifying the criteria by which we select these sets. In "magnetic" case we were 
constraining all four points to the equator. In the "electric" case, on the other hand, 

6 Corrections to magnetic Lagrangian 

Once again, we are considering a contour based on si, s 2 , S3 and s 4 , where it is assumed 
that all four of these points are lying on the "equator" of a(p,q), where it is assumed that 
the point a; is a "midpoint" of a geodesic connecting p and q: 

r(p, r) = r(r, q) = iM = r ■ p -<* Sl , 2 , 3 , 4 ^* q (179) 

Since we are computing pre-Lagrangian as opposed to actual Lagrangian, we are assuming 
that p and q are "fixed". Our goal is to select s±, s 2 , S3 and s 4 in such a way that the value 
of 

K B = a(si, s 2 ) + a(s 2 , s 3 ) + a(s 3 , s 4 ) + a(s 4 , si) (180) 

is maximized with respect to the above constraints. Now, from section 6 we already have an 
"approximate" idea of where si, s 2 , S3 and s 4 are positioned. Our present task is to compute 
deviations from that configuration due to the higher order derivatives. In other words, we 
would like to set 

s[ — si + Ssx ; s' 2 = s 2 + 5s 2 ; s' 3 = s 3 + 5s 3 ; s' 4 = s 4 + 5s 4 (181) 

and then maximize it with respect to 8si, <5s 2 , <5s 3 and <5s 4 by using the usual derivative 
techniques. 

For the purposes of this paper, we are only interested in lowest order correction. Let 
us, therefore, take a qualitative look and see which corrections we should take into account 
and which we should ignore. The area of the square loop is 0(t 2 ) which means that the 
flux of magnetic field through that loop is 0(r 2 ) as well. The displacement of our "points" 
has to be "small" with respect to the "size" of the Alexandrov set, 2r. Therefore, their 
displacement is likewise of 0(t 2 ). Thus, the "correction" to the flux of the magnetic field is 
of 0(t 4 ). Now, the Lagranigan generator is equal to square of the magnetic flux. Therefore, 
the "original" Lagrangian generator is (0(r 2 )) 2 = 0(r 4 ). On the other hand, the "corrected" 
flux is (0(r 2 ) + 0(r 4 )) 2 = 0(r 4 ) + 0(r 6 ) + 0(r 8 ). We will, therefore, "throw away" 0(r 8 ) 
term and "look for" the 0(r 6 ) corrections. In other words, we are seeking expression of the 
form 0(r 4 ) + 0(r 6 ). 

Once again, we will select a coordinate system so that its "origin" coincides with point 
r, and t axis passes through p and q. Thus, x, y and z axes all cross the equator. In light of 
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the fact that we assume that spacetime is flat, and also since our constraint p -<* Si, 2,3,4 ~<* Q 
is highly restrictive, we know that the t coordinate of these four points is exactly zero. While 
the points are going to undergo small "shifts", these "shifts" will be constrained to a sphere 
of radius r on xyz-plane. Now, since "electric field" is identified with F k this immediately 
implies that the effect of the latter will continue to be exactly zero. This allows us to focus 
exclusively on the magnetic field. Now, we will identify z-axis with the direction of the 
magnetic field at the origin: 

B(0) = Bz (182) 

At the same time, away from the origin, the values of B x and B y can become non-zero. 
However, in light of the fact that we are only concerned about the interior of the Alexandrov 
set, their values are of O(r): 

B x = 0(r) ; B y = 0(r) (183) 

Let us now look at the contributions of linear and quadratic terms. From antisymmetry it is 
clear, if we attempt to "integrate" B x and B y then the linear terms will drop out unless we 
shift the contour of integration slightly. If we do shift the contour, then the effect of the first 
order terms will be multiplied by the modification of area due to the shift. Since the area, 
itself, is of 0(t 2 ), its modification is of 0(t 3 ). Furthermore, since the size of Alexandrov set 
is of 0(r), the linear effect on its own is of 0(r). Thus, together, the linear effect becomes of 
0(t) x 0(t 3 ) = 0(r 4 ). The quadratic term, on the other hand, is of 0(r 2 ); thus it "starts out" 
smaller than linear term. At the same time, however, the quadratic term is no longer an odd 
function, thus we do not need to shift the contour in order for its integral to be non-zero. 
Thus, we are multiplying the quadratic term by the unperturbed area 0(r 2 ) (as opposed to 
0(t 3 ) perturbation we were multiplying first order term by). Therefore, the "integral" for 
second order term will end up being 0(r 4 ), just like it was for the first order! Thus, the 
"total" integral is, likewise, 0(r 4 ): 

J dAB x = 0(r 4 ) ; J dA B y = 0(r 4 ) (184) 

Now, the above integrals are intentionally taken over "scalar" area element dA, and the 
integrands are "real valued" functions B x and B y . Let us now replace dA with ds and B x 
and By with B x x and B y y, respectively. If the contour was lying on rry-plane that integral 
would clearly be zero. Therefore, the value of that integral can be assumed to be proportional 
to the angle by which s[, s' 2 , s' 3 and s' 4 have "shifted": 

J ds- (B x x + B y y) = ®(f dAB a .)o(Zs' 1 Osi) (185) 

In the above expression B x is not "better" than B y , nor is Zs[0si any more "important" than 
Zs 2 0s 2 either. We simply assumed that the "order of magnitude" won't change upon these 
replacements, which is why we "picked" arbitrary "examples" of the "orders of magnitude" 
we are looking for. Now, since points are only allowed to "shift" by "small" amounts, the 
angle by which the contour tilts is also of 0(t): 

Z Sl 0s[ = 0(t) ; Zs 2 0s 2 = 0(t) ; Zs 3 0s' 3 = 0(r) ; Zs 4 0s' 4 = 0(r) (186) 
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This means that the "projection" of B x x + B y y onto a contour defined by s[, s' 2 , s' 3 and s' 4 
isof0(r 5 ): 

J ds- (B x x + B y y) = 0(r 5 ) (187) 

Since our calculation is only up to 0(r 4 ), this means that we can safely "throw away" B x 
and By and assume that the only non-zero component of B is B z : 

B x (t,x,y,z) = B y (t,x,y,z) = (188) 

In light of this, we will identify B with B z which will allow us to "drop" index z: 

B(t,x,y,z) = B z (t,x,y,z) (189) 
which is not to be confused with a "vector" 

B(t,x,y,z) = zB(t,x,y,z) (190) 

However, as we will soon see, x and y derivatives of B z do lead to non-negligible contribution 
and, therefore, we do not ignore the latter: 

dB z , dB z dB z , . . 

aT^T^ ^ < 191 > 

At the same time, since contribution of each of the above is 0(r 6 ), their "products" are, in 
fact, negligible. This allows us to take each of the above derivatives "one at a time" and 
any given time assume that the derivative we are looking at is non-zero while the other two 
derivatives are zero. 

Let us start with dB z /dz. It is easy to see that if points are trying to "maximize" the 
flux, they will shift "upward" if dB z /dz is positive and "downward" if dB z /dz is negative. 
Furthermore, since we are not neglecting dB z /dz, we are neglecting dB z /dx and dB z /dy 
(and we are planning to count these separately, later on). Thus, we have a symmetry around 
z axis, which implies that the points "shift" vertically by the same amount. Let us assume 
that the "common" amount by which the points shift is h. Again, because of the symmetry 
around z-axis, we can assume that the points will not shift horizontally (the only exception 
is a slight shift towards the z- axis in order make sure that the points "stay" on the sphere). 
Thus, they continue to form the square; but the length of the sides of the square changes 
from a to a' where 

a 2 = 2r 2 ; a' 2 = 2(r 2 - h 2 ) (192) 

Now the area of the "new" square is a' 2 . This means that the flux through that new square 
is B'a' 2 where 

B' = B(0) + h— +0(h 2 ) (193) 

OZ 

Now, since h is "small" compared to the size of the Alexandrov set, we know that 

h = 0(r 2 ) (194) 
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This means that 

0(h 2 ) = 0(r 4 ) 

Now, since the "flux" takes place through the area of 0(r 2 ), the effect on flux is 
SB = 0(h 2 ) = 0(r 4 ) =}► <5(flux) = 0(r 2 ) x 0(r 4 ) = 0(r 6 ) 



(195) 



(196) 



which makes it negligible. Thus, we have established that we can throw away the 0(/i 2 ) 
terms and just use linear expression for B: 



(197) 



Now, as we stated earlier, the "flux" through the shifted square is B'a' 2 . By substituting 
the expressions for B' and a' we obtain 



Flux = a' 2 B' 2 = 2(r 2 - h 2 ) { B(0) + h^- 

V ah 



(198) 



Now, our previous reason for "throwing away" 0(h 2 ) terms was the fact that we were planning 
to multiply them by the area which is 0(r 2 ). In the above expression, however, we have 
already performed a multiplication by the area. Thus, no further multiplication will be 
performed, which means that we will keep 0(h 2 ) terms, seeing that 0(h 2 ) = 0(r 4 ) and we do 
not neglect 0(r 4 ) terms. At the same time, since 0(r 4 ) = 0(h 2 ) is the highest order that we 
keep, we will throw away 0(/i 3 ) term. Thus, we obtain 



F1ux = 2(t 2 5(0) + t 2 /^- -h 2 B(0)) 

\ OZ / 



Thus, in order to maximize flux, we need to find a value of h such that 

d ( o „/„s o, dB 



-h z B 



(0)) 







This implies that 



f_d_B_ 
2B~dJ 



(199) 



(200) 



(201) 



The fact that h has the same sign as dB/dz implies that the points will be "shifted" in the 
direction of gradient of B. Indeed, this is what we intuitively expect would maximize the 
flux. If we now substitute the above value of h into the expression for flux, we obtain 



Flux = 2B t 2 + 



2 , r'fdB 



B\dz 



(202) 



As a result of the squaring, the increment of flux is positive, regardless of sign of dB/dz. 
This is indeed what we expect. Both positive and negative value of dB/dt provide extra 
opportunity for us to bring flux closer to what we "want" it to be. Thus, as long as we 
"want" to make it larger, both positive and negative values of dB/dz will do just that. 

The above calculation is made under the assumption that the only non-zero derivative is 
dB/dz. But, as we stated previously, both dB/dx and dB/dy do have non-negligible effect. 
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We simply decided to compute the effects of the derivatives separately. So let us now move 
on to finding an effect of dB/dx. Thus, we will now assume that dB/dz is equal to zero 
and allow both dB/dx and dB/dy to be non-zero (but, at the same time, B is still parallel 
to z- axis, as always). In light of the symmetry between x and y, we do not need to separate 
the effects of dB/dx and dB/dy. Thus, we can assume that both of these derivatives are 
non-zero, and while dB/dz is the only derivative that is zero. 

In light of the non-zero values dB/dx and dB/dy, it is possible that the flux will change 
as we are "rotating" the "square" (with our points sitting in the corner). While the shift 
"away from equator" had to be "small", the rotation on the xy-plane can be anything from 
to 2n. Thus, we would like to first make a "large" rotation of the square to "appropriate" 
position and only after that make a small "shift" away from that position. The rotation will 
be "rigid", whereas the "shift" will involve stretching and compression of the sides of the 
square; that is the reason why the latter can not be absorbed into the former. 

We noticed that the only two axes that we have defined ahead of time were t and z. In 
particular, t-axis coincides with the line passing through p and q, while z-axis coincides with 
the direction of B(0). Thus, we are still free to select the directions of x and y whatever way 
we like. For our convenience, we will choose x and y in such a way that they are parallel to 
the edges of the square. Thus, in the (t, x, y, z)-notation, the coordinates of the four points 
are given by 

Sl = (0, -r, -r, 0) , s 2 = (0, r, -r, 0) , s 3 = (0, r, r, 0) , s 4 = (0, -r, r, 0) (203) 

— * 

Thus, instead of rotating square, we will be "rotating" B: 

B'(t, x, y, z) = B(t, x cos 9 + y sin 9, —x sin 9 + y cos 9, z) (204) 



Our goal is to select 9 in such a way that the flux of "rotating" B' through the "fixed" square 
is maximized. Now, the flux through the square is given by 



Flux 



L dx LM 



B(0)+x 



dB 



+ 



x 2 d 2 B 



2 dx 7 



+ 



y 2 d 2 B 



2 dy 2 



dx 
d 2 B 



dB 
o dy 



o dxdy 



) + 0(r 5 ) 



(205) 



Clearly, the terms proportional to x, y and xy are odd and, therefore, integrate to zero. 
Thus, our integral becomes 



Flux = J dx J dy{B{Q) + 



x 2 d 2 B 



2 dx 2 



y 2 d 2 B 



J + 0(r 6 ) 



(206) 



where we have replaced 0(r 5 ) with 0(r 6 ) because the 0(r 5 ) terms are, likewise, odd and 
therefore also integrate to zero (but, of course, this point is simply aesthetic given that we 
ignore both 0(r 5 ) and 0(r 6 )). Now the above integral evaluates to 



Flux = 2r 2 5(0) + -r 4 ( 



d 2 B 



dx 2 



d 2 B 



dy' 



+ 0(r 6 ) 



(207) 
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The above is invariant under the rotation of B. However, the "very small" terms that we 
"neglected", regardless of how small they might be would still result in a "finite" effect when 
it comes to B. For example, if the "small" term has the form T 100 sin#, it would still "select" 
9 = it/2, despite the "smallness" of r 100 . The only thing that can "stay in the way" is a 
"larger" term, such as 9 80 . This, however, does not change the fact that the "largest" term, 
whatever that term might be, would in fact be a determining factor of 9. In the past that 
issue was avoided because the "largest" term happened to be 0(r 4 ). In the present situation, 
since things are symmetric up to 0(r 4 ), the higher order terms will "have a final say" on 
what 9 would be. 

The good news, however, is that the "final" effect on 9 will be "unimportant". After 
all, the only reason we are interested in 9 is that we are interested in flux. Now, from the 
above equation 9 will have an 0(r 6 ) effect on flux, which means we don't care about it. In 
our subsequent calculations, we will pretend we will know 9. However, our calculations, up 
to 0(r 4 ), will be identical regardless of the actual value of 9. Therefore, we don't have to 
actually "find out" what 9 is, which means that we don't have to look at 0(r 6 ) terms either. 
In fact, this pattern can be generalized to other situations. The only way "something" is 
"important" is when it produces 0(r 4 ) terms. If such is the case, then that "something" 
won't be influenced by 0(r 6 ) ones. On the other hand, if that "something" doesn't produce 
0(r 4 ) terms then it "begins" to be influenced by 0(r 6 ); but then the lack of 0(r 4 ) makes 
that "something" unimportant. Thus, 0(r 6 ) terms "either" have very small influence on 
something "important", or they have large influence on something "unimportant". In both 
cases they can be neglected, just for different reasons. 

To make long story short, we have answered the question regarding "rigid rotation" of 
the square. In particular, yes the square will be rotated, but it is not important to find 
out how; so we won't. Now it is time to consider the small oscillation from its position, 
whatever it might be. As was stated earlier, small oscillations can not be "absorbed" into 
rigid rotation since they involve stretching and shrinking of sides of the square. For that 
same reason it turns out that the "stretching and shrinking" we are about to consider will 
have the 0(r 4 ) effect that we are looking for, despite the fact that rigid rotation does not. 
Now, the displacement of each point is of 0(r 2 ) which means that its effect can't be larger 
than 0(r 4 ). Therefore, the "interaction" between different displacements will be of 0(r 6 ) and 
can be neglected. Thus, we will consider the displacement of each point separately and then 
take their superposition. 

As was stated earlier, the only axes that are rigidly fixed are t and z. On the other hand, 
the choice of x and y is up to our convenience. Let us, therefore, change our choice of xy 
coordinates and this time select them in such a way that the coordinate axes pass through 
the four points on the "rigid" square. Thus, in the (t,x,y,z) notation, 

si = (0, r, 0, 0) , s 2 = (0, 0, r, 0) , s 3 = (0, -r, 0, 0) , s A = (0, 0, -r, 0) (208) 

Now, we will consider the effect of "displacement" of s±, while we will leave other points 
fixed. Thus, the "displacement" takes place from si to s[, where 

si = (0, r, 0, 0) , s[ = (0, r cos 9, r sin 9, 0) (209) 
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For our convenience, we will denote r sin 9 by h: 



h = rsin9 (210) 

The integral over the "square" s' 1 S2SsS4 can be represented as a sum of the integral over a 
triangles ^s'^ and s 2 s 3 s 4 . Since the only "change" we are making is going from sx to s[, 
the triangle s 2 s 3 s 4 is left unchanged. Thus, we can exclusively focus on s[s2S 3 . Now, as we 
just said, we can assume that SiS 2 s 3 is a triangle rather than a circle. We now notice that 

s' r s 2 s 3 = (sis 2 s 3 U s 2 s 1 s[) \ s 4 sisi (211) 

This means that the "correction" to our integral takes the form 



BdA - / BdA = / BdA - / BdA (212) 

We know that the area of the square is 0(r 2 ). Therefore, the area of the two small strips we 
are integrating over is 0(r 3 ). Thus, the terms of 0(r) under the integral become 0(r 4 ) once 
the integral is evaluated. Therefore, 0(r) is the highest power we leave under the integral. 
This means that we can expand B only up to linear terms: 



OB 



dx 



OB 



dy 



B = B(Q) + — x + — y (213) 







Now, in light of the fact that B(0) is "large" while linear terms are 0(r), we will need 
different kinds of approximations to compute them. Thus, it would be best to compute them 
separately. As far as B(0) integral is concerned, we simply have to look at the variation of 
area. It is easy to see that 

Area(s 4 s 2 Si) = r 2 ; Area(s 4 , s 2 s' 1 ) = r 2 cos# (214) 

This immediately implies that 

J B(0,0)dA- J B(0,0)dA = -r 2 (l -cosfl) (215) 

*/S4S2S^ J S4S2S1 

Let us now do the linear terms. From what we have just seen, the deviation between 1 and 
cos# leads to 0(r 4 ) terms when it comes to something "finite", such as -5(0, 0). Now, in case 
of linear terms, we have extra 0(r). This means that the difference between 1 and cos 9 will 
now produce 0(r 5 ) effect, which can be thrown away. Thus, we will replace cos 9 with 1 by 
saying 

s[ = (0, r,r sin #,0) (216) 

Now, the line connecting s 2 and si is t = r — x. The line connecting s 2 and s[ is t — 
t — x(t — h)/r. The line connecting s 4 and s\ is t = x — r, and the line connecting s 4 to s[ 
is t = x(t + h)/r — r. Thus, according to the Equation 12121 the correction becomes 

/ (B(x,y) - B(0,0))dA- (B(x, y) - B(0, Q))dA = (217) 
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dx 



^ ( dx 



dB 

x + 

o dy 



y 



dx 



dB 

x + 

o dy 



When we compute that integral on the right hand side, we obtain 



(B(x,y)-B(0,0))dA- 



(B(x,y)-B(0,0))dA=^^- 



(r - h) (218) 



(B(x,y) - B(0,0))dA 

>A 

Now, by substituting 

we obtain 

(B(x,y)-B(0,0))dA- 



And, if we apply Equation 12121 to the left hand side, we obtain 



(B(x,y)-B(0,0))dA 



h = r sin 9 



(B(x,y)-B(0,0))dA=^-^- 

S4S2S1 " ^y 



hr dB 
3 dy 



(r-h) 



(sin 9 — sin^ 



(219) 
(220) 
(221) 



Now, we will add the integrals over B(0, 0) and over B(x, y) — B(0, 0) to obtain 



B(x,y)dA 



t 3 dB 

B(0, 0)dA = -r 2 (l - cosfl) + — — 

3 dy 



(sin 9 — sin z 



(222) 



Now, in order to find the "maximum" we have to differentiate it with respect to 9 and equate 
derivative with zero: 



r 3 dB 

T*B(0)sm9+ T -^- 
3 dy 



(cos 9 — 2 sin 9 cos i 







After factoring out — r 2 from the above equation we obtain 



r dB 

B(0)sin*--- 



(cos 9 — 2 sin 9 cos i 







(223) 



(224) 



We will now Taylor expand the above and solve it for 9. Let us now see to what order we 
want to do that. The only purpose of knowing the value of 9 is to substitute it into Equation 
12221 Now, we want to compute the right hand side of Equation 12221 up to 0(r 4 ). Now, 
the 0(# 2 ) term will lead to 0(r 2 # 4 ) contribution in r 2 (l — cos#) and 0(t 3 9 2 ) contribution in 
r 3 (sin 9 — sin 2 9). Since 0(9) = 0(r) these two contributions are 0(r 6 ) and 0(r 5 ) respectively. 
Since our calculation is up to 0(r 4 ) both can be neglected. This means that we can neglect 
0(9 2 ) correction to 9 and, therefore, write 



9 



t dB 



35(0) dy 

Now, we will expand Equation 12221 up to 0(r 4 ), 



B(x,y)dA 



B( x ,y)dA =-_ + __ 



(225) 



(226) 
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and substitute the value of 6 we have just found, to obtain 



B(x,y)dA - 



t 4 fdB 

B(x,y)dA = rrf^F 

, wai 185(0) \dy 



(227) 



The above equation pertains to the variation of the flux through a "triangle" S1S2S4. We are 
interested, however, in the flux through the "square" S1S2S3S4. But, one can clearly see that 
the flux through the square is equal to the sum of the fluxes through two triangles, S1S2S4 
and S3S2S4. In light of the fact that s% is the only point that is being moved, the flux through 
S3S2S4 will not change. Thus, the change of the flux through the square S1S2S3S4 is identical 
to the change of the flux through the triangle SiS 2 S4 : 



B(x,y)dA 



/ dB 

B{x,y)dA= ——— ( — 
Sl s 2S3 s 4 185(0) V dy 



(228) 



Now, the above variation was only pertaining to the displacement of S\. Now we will add 
the displacements of all four points. From symmetry it is clear that effects of displacement 
of S2, S3 and S4 can be obtained by rewriting the above equation while replacing y with — x, 
—y and +x, respectively. Since the only place where — x and — y are present is derivative, 
squaring of that derivative will turn them into +x and +y, respectively. Thus, we obtain 



B(x,y)dA - 



B(x,y)dA 



S1S2S3S4 



B(x,y)dA 



SlS' 2 S 3 S4 



-/ 

J Si! 



B(x,y)dA 



2S3S4 



B(x,y)dA - 



S1S2S0S4 



B(x,y)dA 



S1S2S3S4, 



L 



B(x,y)dA 





(dB 


185(0) 


\ dy 




fdB 


185(0) 


v dx 




fdB 


185(0) 


v dy 




fdB 


185(0) 


ydxo 



(229) 

(230) 
(231) 
(232) 



B(x,y)dA 

Finally, in order to "shift" all four points by going from {s\, S2, S3, S4} to {s[, s' 2 , s' 3 , s' A }, we 
have to sum the effects of the shifts of any individual point (and we don't have to look at 
"interaction" between the shifts since that would be of higher order than we are concerned 
about). Thus, by suming the above four equations, we obtain 



B(x,y)dA 



1 ,/ „/ „/ 

1*2*3*4 



+ 



( 



dB 



\ dy 



(233) 



where 1/18 changed to 1/9 due to the presence of two identical copies of each term. We have 
not done yet. Apart from the "correction" due to the "shift" of the contour, there is also a 
correction over "any given" contour due to the derivatives of 5, given by Equation 12071 



'S1S2S3S4 



B(x,y)dA- I B(0,0)dA 

J S1S2S3S4 



2 4 /d 2 B 

—T 



dx 2 



d 2 B 
dy 2 



0(r 6 ) 



(234) 
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Now, both of the above correction is strictly due to d x B and d y b. On the other hand, in 12021 
we have found that the correction due to d?B is 



5 2 (Flux) 



dB 



B{0) \ dz 



(235) 



Thus, if we take all of these corrections into account, we will obtain the final equation for 
flux to be 



Flux = 2B(0)t 2 + 



(dB 



95(0) Wdx 



+ 



r 4 fdB \2 2 ^d 2 B 



B(0) V dz 



-T 



dx 2 



/dB 
\dy 

d 2 B 



+ 



dy 2 



Now, the pre-Lagrangian is equal to the square of the flux, thus it is given by 



fdB 
\dy 



/dB\* 8r 6 B(0)fd 2 B 



dx 2 



d 2 B 



dy 2 



(236) 



(237) 



Let us now remind ourselves of the role that the above expression is playing. If we go back to 
scalar field, and if we don't care about the correction terms, then the pre-Lagrangian would 
be given by non-covariant expression 

J* = (do<f>) 2 (238) 

We then write the above in a covariant form by introducing a vector given by 

v% = (239) 

where the subscript <fi stands for field as opposed to coordinate index (thus, v£ has one 
coordinate index /x). We then allow v% to rotate and rewrite the above expression as 



J* = i V B d H 



(240) 



Such expression is not invariant under rotation of v% and, therefore, can not be identified 
with Lagrangian itself (which is why it is called "pre-Lagrangian" ). In order to find the actual 
Lagrangian, we need to select v'f that would minimize the absolute value of pre-Lagrangian. 
Thus, in case of scalar field, we would have 



dPjtdJ) > 



d^cj) 



seal 



d i 



(241) 



and then the substitution of the above vji, indeed, gives us the actual Lagrangian (as 
opposed to "pre-Lagrangian J^)\ 



£ = i<d^) 2 = d^d, 



(242) 



We will now use similar concept for the electromagnetic field. So we would like to select a 
t-axis in such a way that the right hand side of Equation 12371 is minimized. After that, we 
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will substitute into the "pre-Lagrangian" given in I237[ in order to obtain the actual La- 
grangian. Now, we already know from Section 6 that, up to finite order, v 1 ^ is an eigenvector 
of Ml, where 

= F" p F p u (243) 
We would now like to find higher order corrections to v^, 

v% = + 5v B (244) 

Now, in the frame where E and B are "exactly" parallel, the un-corrected expression for Jb 
is "exactly" minimized. Thus, the corrected expression is only "approximately" minimized 
in that frame. On the other hand, there is "another" frame where Lagrangian generator 
reaches "exact" minimum, but E and B are only approximately parallel. We will denote 
these two frames by x p and x ,fl , respectively. Furthermore, we will identify the unit vectors 
in t and t' directions by v B and v B , respectively: 

x ° = x ^ ; x '° = (245) 

Furthermore, we will select z-axis in such a way that, in v^-hame, it coincides with a common 
direction of E and B. If we denote F pu tensor in these two frames by F pu and F' , this means 
that 

Fu = F 23 = F 01 = F 02 = (246) 

while the above statement will not be true if we replace F with F' . Now, our task is to 
select x'^-frame in such a way that F[ 2 is minimized. 

In order to make F[ 2 distinct from F 12 we have to "mix" it with something non-zero 
that has either index 1 or index 2. Given our constraints, the only non-zero element that 
meets the above description is F 12 , itself, which prevents us from having first order variation. 
However, we can produce second-order variation. For example, we can rotate F 03 to get F" 3 
and then we can further rotate F" 3 to get F[ 2 . Another avenue is to first rotate F 12 to get 
either Fq 2 or F" 3 and then we can further rotate the latter into F[ 2 . This last option would 
account to the rotation of F\ 2 through the A^. However, all of the options we have just 
discussed will result in second order variation rather than first order (which is due to the 
fact that everything other than F03 and F\ 2 is zero). 

Now, any kind of sequence of boosts can be produced through a single boost along the 
appropriately chosen direction. This means that we have to try to minimize the Lagrangian 
generator with respect to four degrees of freedom: three degrees of freedom tell us the 
direction of the boost, and the fourth tells us the magnitude of the boost. We can proceed 
by first selecting a fixed direction and minimizing with respect to magnitude, and after 
that minimizing the produced "minimum" with respect to direction. Now, we have seen 
previously that the Lagrangian generator takes to form 

J B = A\B\ 2 T i + At 6 (247) 
Now, from what we have just said, the variation of \B\ is of the order of (5v) 2 : 

\B'\ = \B\ +a(5v) 2 (248) 
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This, however, might not be true for A. After all, the above statement for \B'\ was based 
on the assumption that F 12 and F Q3 are the only non-zero components of the tensor. On 
the other hand, the coefficients next to 0(r 6 ) terms are more complicated (for example, they 
have second derivatives, among other things); thus the "second derivatives" of things that 
were assumed to be zero are no longer zero. This means that the variation of A is of first 
order: 

A' = A + bSv (249) 
This means that the modification of Lagrangian generator is given by 

J' B = 4(151 + a(H 2 ) V + (6 + d5v)T 6 (250) 

Up to 0((Sv) 2 ) this becomes 

J' B = 4|S|V + 8\B\a(Sv) V + dr G 5v (251) 

In order to find the minimum, we have to equate the derivative of J' B with respect to 5v to 
zero: 

3 7' 

= = l6\B\ar A 5v + dr 6 (252) 

a{ov) 

which implies that 

dr 2 

5v = — V- (253) 
16151a 

In other words, we have just shown that 

8v = 0(r 2 ) (254) 

This immediately implies that 

(5v) 2 r 4 = 0(r 8 ) ; t 6 5v = 0(r 8 ) (255) 

Since we are computing up to 0(r 6 ), this means that the impact of Sv can be neglected 
altogether. Therefore, we don't need to find out the value of 5v either. We can simply stick 
to the Equation 12371 as our final expression both for Lagrangian generator as well as actual 
Lagrangian, as far as 0(r 6 ) is concerned. 

Let us now write the Equation 12371 in a covariant form. First of all, in equations 11771 
and 11781 we already found out a covariant expression for \E\ 2 , \B\ 2 and E ■ B. Thus, we can 
freely be using these three quantities. However, we can not use un-contracted B^, nor can 
we "contract" it with anything else, such as v B B^. Our task is to manipulate \B\ 2 , \E\ 2 and 
B ■ E in such a way that we will arrive at the rest of the expressions we might need. In order 
to do it, we can utilize the constraints "at the origin" 



B x (0) = B y (0) = ; v B (0) = 6% (256) 
At the same time, these constraints no longer hold away from the origin: 

dpB x ± ; d^By ^ ; d v v B (257) 
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After all, we would like our coordinate system to be "rigid" in a sense that Christoffel's 
symbols are zero. This means that if B z "twists around" it can't possibly be identified with 
z-axis at more than one point. The most we can do is to rotate z-axis so that the two 
are identified at the origin. Furthermore, we recall from the previous discussion that is 
determined based on E and B. This means that Vg likewise "twists around". So, for the 
same exact reason, can be identified with i-axis only at the origin, and not elsewhere: 

«£(o) = tf ; (&«£)(o)^o (258) 

Accordingly, at the origin, B z and \B\ coincide; but their derivatives do not: 

B Z (Q) = \B(0)\ ; d,B z (0) + 8^(0)1 (259) 

However, due to the fact that the derivatives of B 2 and By are both zero, it is easy to see 
that the derivatives of \B\ 2 and B 2 coincide: 

d z \B\ 2 = d z B 2 z = 2B z d z B z (260) 

However, as long as we take "squares" of the above quantities, their derivatives will be zero: 

(<9^)(0) = 2^(0)3^ = ; (3„Bj)(0) = 2B y {Q)d tl B y = (261) 

This also implies that 



Now by using 



d,\B\ 2 = d^Bi (262) 
B z (0) = \B(0)\ (263) 



the above equation evaluates to 

d,\B\ 2 = 2B z (0)d,B z = 2\B\d,B z (264) 

which implies that 

d,B z = ^-J- (265) 
2\B\ 

where \B\ is given by Equation 11771 Now, we are not done yet: since we don't have a 
coordinate system, we can't have un-contracted index \i. Let us discuss cases by cases how 
we get rid of it. Lets start with fi = 3; in other words, we want to produce d z B z . In light of 
the fact that B(0) points along z-axis, we know that 

B»(0)d,f = B z (0)d z f = \B(0)\d z f (266) 

This means that 

9J = -J—B»(0)d^f (267) 
B ( 



Now, in order to write a truly covariant expression, we have to write B^ in terms of F^. 
Since B is parallel to z-axis, we know that 



01 — r 02 — ^13 — -T23 



(268) 
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Therefore we can rewrite B^d^f as 

B»d„f = B 3 d 3 f = F 12 d 3 f (269) 

Now, we would like to come up with covariant expression that produces F 12 without producing 
Fqs- We will do the following trick. First we notice that v B is pointing along z direction. 
This immediately implies that ea(3'y5v% is non-zero only when 0, 7 and 5 are all non-zero 
at the same time. Thus, by contracting with any two of these three indexes we will 
immediately "get rid" of F 3 and be left with F\ 2 : 

e aPl sv a B F^d s f = -2F 12 d 3 f = -2B 3 d 3 f = -2B z d z f (270) 

where we have (+, — , — , — ) convention, and B z is identified with B 3 rather than B 3 : 

F 12 = B Z = B 3 = -B 3 (271) 

while d z is identified with d 3 : 

d z f = d 3 f (272) 
Now, by recalling that B z = \B\, we can rewrite the above expression as 

d z f = —^e a ^6V a B F^d 5 f (273) 
2(51 

Furthermore, immediately from the choice of the t-axis, we know that 

dtf = i;£0 M / (274) 

As far as d x and d y are concerned, the symmetry around z-axis prevents us from " separating" 
them from each other. But we can easily compute the sum of their squares by using 

(d x f) 2 + {d y ff = (d f) 2 - (d z f) 2 - Pfdpf (275) 

By substituting the expressions for d f and d z f the above becomes 

{d x ff + {d y ff = {v B d,ff - -L-U^sv^dtf) 2 - d^fdj (276) 

A\B\* 

Now, let us apply it to / = B z . We can use Equation 12651 

3,B Z = a -^p- (277) 
2\B\ 

to do the following substitutions: 

^ B d,B z f = (^^) 2 = ^r 2 W,\B\ 2 ) (278) 

d s \B\ 2 \* 



, , e a ^ s v a B F^d s B z ) 2 = -—- (e a/3l5 v a B F^^-J-) (279) 
4 LB 2 4 LB 2 ^ 2LBI ' 
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By combining factors of 2 as well as powers of \B\, the last equation can be further rewritten 

as 

-i-(e a ^ s v^W 5 B z ) 2 = —L^(e a p j5 v%F^d 5 \B\ 2 ) 2 (280) 
4 ig 2 ' ' 16 LB 4 

Finally, d^fd^f is now being replaced with d^B z d^B z , which can be rewritten as 

d»B z d„B z = d ^ B J d ^ B J = _^ 9 »\B\ 2 dJB\ 2 (281) 
2\B\ 2\B\ 4\B\ 2 

By substituting |278l [280] and EM] into [2751 we obtain 

dB z \* , ^\ 2 1 , u« ,^2 



+ = (282) 



lGlB^ A\B\ 

Now, our final goal is to rewrite [2371 in a covariant form. The only part of the above equation 
which we have not yet written in a covariant form is the second derivative terms. The only 
such expression present is d 2 B z + d 2 B z . Let us, therefore, find a covariant equivalent of the 
latter. First, it is easy to see that the above expression can be written as 



8lB z + d 2 B z = S-Fjd^Ff (283) 
LB 2 



The only problem is that if we will replace Latin symbols with Greek, we would obtain 
unwanted zero components. We would like to use the vector v 1 ^ in order to "get rid" of 
them. Suppose, for example, we want to write a covariant expression for \w\ 2 for some other 
vector w. In the frame in which t-axis coincides with v 1 ^, we can write it as 

\w\ 2 = ( w ) 2 - w » Wfl = ( w %) 2 - w^Wf, (284) 

Now, in light of the fact that the expression we want to evaluate is a lot longer, we want 
our calculation to look as simple as possible. Thus, we will be using do and wo in place of 
v^d^ and v^w^ and then we will replace all of the 0-s with appropriate ^-contractions at 
the very end. We will take care of one component at a time. Let us first "get rid" of k in 
12831 

d 2 x B z + d 2 B z = -^FfidlFf - jl^F %d>F° (285) 

By comparing the right hand side of the above to the right hand side of 12831 one can see 
that the number of terms changed from 1 to 2 due to the "split" of space-alone term into 
covariant term together with time-alone one. In general, each time we get rid of a "Latin" 
index, the number of terms always doubles (thus, since we have to also get rid of i and j we 
will have 8 terms at the end). Now, we will "get rid" of i and obtain 

d 2 x B z + d 2 B z = J-FfWF/ - * F^F/ - -L-Ffdt&F* + ^-Ffd^F? (286) 
\B\ \B\ B \B\ 
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where, as usual, F ° = 0, but we are keeping that term in order to convince ourselves that 
we have the right number of terms and that this number " doubles" as expected. Finally, we 
will get rid of j: 

d 2 x B z + d 2 B z = jl^F p °d d°F P - Jj-FfdoPFf - ^Ffd^Ff + - J- Ffd^Ff- 

- -i-F %d°F + -L-FfdodTF? + -i-F ^d°F ° - -l-F^d" F u ° (287) 
\B\ Z \B\ A \B\ A \B\ Z 

The above equation has 8 terms, as expected. Three of these 8 terms involve Fqq and, 
therefore, are equal to zero. We can now get rid of these 3 terms, and obtain 5-term 
expression: 



d 2 x B z + d 2 y B z = -^- 2 F p d d F p - F^FJ>- {21 



\B\ 2 p \B\ 2 p v \B\ 2 

Finally, if we substitute / = |S| 2 into Equation 12731 

d z \B\ 2 = —L ea p jS v%F^d s \B\ 2 (289) 
2\B\ 

and then substitute 12821 12881 and 12891 into 12361 an d doing some simple combining of terms, 
we obtain 

C B = 4|5| V + -^-K^l^l 2 ) 2 - -%-(e a ^v%F^d s \B\ 2 ) 2 - 
9\B\ 2 B ^ ' ' 9| j g|4 v 

r 6 8r 6 8r r 



d»\B\ 2 d^\B\ 2 + -^v a v p B d 1 d d F a dpdr<F/ -v a v B F a d p d^F p - (290) 

9|-B| 2 3|5| ' 3 1 -B | 

" ^fivlFfdadPFf + ^F^F/ - ^-v^F^FJ 

where B is given by Equation 11771 



= \ ( F^Fq i -i {s^F'n-),,,))^^ ^ ,) 2 - j^,;sl^F^ j (291) 



7 Corrections to electric Lagrangian density 

Let us now turn to the "electric" Lagrangian. Again, we are considering a flux through a 
contour bounded by four points; but there is one important difference. In the "magnetic" 
case, all four points were lying on the equator of Alexandrov set. In the electric case, on the 
other hand, two of the four points lie on the equator, and the other two points coincide with 
the poles. This also has an impact on a particular ways the contour can be deformed. In 
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magnetic case we were able to move all four points of the contour. In electric case, we can 
only move two points that lie on equator; the other pair of points is constrained to coincide 
with poles and, therefore, can't be moved. 

Now, in the magnetic case, we were making a "rigid shift" of the contour in z direction. 
This, of course, requires displacement of all four points. In the present situation, our inability 
to displace two of the points will prevent us from making such shifts. Furthermore, in the 
magnetic case we could slightly stretch or compress the length of the edges. In our present 
case, each of the edges is "constrained" to connect a point on a pole with a point on the 
equator. This ultimately constrains both its spacelike and timelike projections to coincide 
with t. As a result, the magnetic field has no contribution to flux through this contour, and 
the entire contribution comes solely from the electric field. 

Let us denote the poles of Alexandrov set by A and C, and denote the two points on 
equator by B and D. In case of the constant electric field, points B and D will be on the 
opposite sides of the equator, chosen in such a way that the line connecting them is parallel 
to electric field. If, however, we introduce a derivative of electric field with respect to some 
perpendicular direction, then points B and D will shift in the direction of the derivative. 
At the same time, A and C will stayed "glued" to the poles. This will result in rotation of 
triangles ABC and ADC by an angle 9 (which would be the same if we neglect the second 
derivatives). In light of the fact that the direction of electric field at B is parallel to the one 
in D, it is easy to see that these two triangles will rotate "towards" each other. But the 
angle of rotation will be small. 

Let us denote the square loop by ABCD, where A and C are the two poles of Alexandrov 
set and B and D are located at the equator. As usual, we will select t axis in such a way 
that it passes through A and C. Thus, 

A = (-r, 0,0,0) , C= (r, 0,0,0) (292) 

Furthermore, we will select z axis to be the direction of the electric field at the origin. Thus, 

E x = f ■ VE X ,E v = r- VE y (293) 

E z = E(0)+f- VE Z (294) 

If the derivative terms were zero, the flux would have been maximized by setting B and D on 
the z-axis. If the derivatives are non-zero, they will be shifted slightly from these locations. 
Thus, 

B = (0, -r + 0(r 2 ), 0(r 2 ), 0(r 4 )) , D = (0, r + 0(r 2 ), 0(r 2 ), 0(r 4 )) (295) 

Here, 0(r 2 ) terms correspond to the slight shift of points tht we have just discussed. The 
reason we used 0(r 4 ) rather than 0(r 2 ) in the ^-coordinate, is that we are assuming that B 
and D lie exactly on the surface of the sphere; in other words, the equation 

B 2 X + B 2 y + B\ = D 2 X + D 2 y + D\ = r 2 (296) 

holds exactly. From the point of view of causal set theory, this is enforced by an assertion 
that all links are direct; that is, there is no point E satisfying A ~< E ~< B, A -< E -< D, 
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B~<E-<CoiD~<E-<C. The combination of Equation 12961 with the fact that x- and 
y-coordinates deviate by 0(r 2 ) implies that ^-coordinate deviates by 0(r 4 ). 

The fact that these links are direct is also the reason why we have not included the 
deviation of t-components. After all, if the point B were to shift "upward" from the equator, 
then it would be "forced" to either be spacelike-separated from C or to have non-zero 
distance from B. In the former case we would get a "contradiction" with an "existence" 
of BC-\mk, while in the latter case we would get a contradiction with "direct nature" of 
v4i?-link. Similarly, if we were to shift B downwards, we would either have to violate the 
existence of AS-link or direct nature of BC-\mk. Similar argument also prevents us from 
shifting point D either upward or downward. At the same time, it should be acknowledged 
that if we were to include gravity then B and D satisfying the above conditions would pick 
up i-component due to the bending of light cone. But, as we have stated earlier, we are 
postponing gravity to [7] and in this paper we are assuming flat spacetime. The assumption 
that curvature is exactly zero corresponds to the assumption that the t-component is zero 
as well, up to the discreteness scale. 

Let us now discuss the order of magnitude up to which we will evaluate the spacelike 
0(t 2 ) displacement of these points. As we stated earlier, the unperturbed flux through the 
square loop is 0(r 2 ). Now, the displacement has three possible effects: the derivative of 
the field with respect to the direction of displacement, the angle of the projection of flux, 
and the variation of area of the displaced contour. Now, if we were dealing with triangle, 
such as ABC, it is easy to see that the displacement between the centers of triangle and 
Alexandrov set is of 0(r), leading to the variation of flux of 0(r 2 ) x 0(r) = 0(r 3 ). If, on 
the other hand, we consider the square instead of triangle, we would no longer have such 
displacement. Instead, the displacement distance would be of 0(r 2 ) leading to the variation 
of flux of 0(r 2 ) x 0(r 2 ) = 0(r 4 ). As far as the projection goes, we expect the angle of the 
tilt of the contours to be 0(r). This means that the cosine of that angle is 1 — 0(r 2 ). This 
would, again, lead to 0(r 2 ) x 0(r 2 ) variation of the flux. 

Finally, in electric case, we expect the area variation to be zero, contrary to its 0(r 2 ) 
value in magnetic case. After all, in magnetic case we had a contour "lifted" by all four 
points, while in electric case we have two triangles {ABC and ADC) rotation. It is easy to 
see that the area changes in the former case but not the latter. At the same time, the area 
of the projection of "two triangles taken together" onto yz plane will, in fact, change. But 
the effect of this variation of area is "absorbed" into the effect of the projection. In fact, one 
can re-think the "projection of the field onto tilted triangle" as "projection of the field onto 
the shaddow of tilted triangle on the original plane". If the field lines are perpendicular 
to the original plane, then any given line will pass through the tilted triangle if and only 
if it will also pass through its shadow. Thus, both fluxes get multiplied by cos# just for 
"different reasons" (in one case cos 8 comes from projection, in the other case it comes from 
area modification). 

In this respect we can re-think "projection onto triangles ABC and ADC 11 in terms of 
"change of area of projection of contour ABCD after shifting B and D" , while keeping in 
mind that said contour will no longer be square and, instead, it will be three-dimensional 
object. This new interpretation makes it more analogous to magnetic case: in magnetic case 



46 



we were dealing with "area modification" due to "shift" of all four points (A, B, C, and D) 
while in electric case we are looking at area modification due to the shift of only two of the 
four (B and D). In both cases the resulting variation of flux is 0(r 2 ) x 0(r 2 ) = 0(r 4 ), and 
in both cases source of one of 0(r 2 ) is cos# = 1 — 0(t). In electric case we are using cos# 
in order to make a projection onto our two triangles while in magnetic case we need cos 6* in 
order to compute the modification of area of shifted rectangle. 

Be it as it may, the bottom line is that the original flux is 0(r 2 ) and all of its higher order 
variations are of the order of 0(t 2 ) x 0(t 2 ) = 0(r 4 ). Now, the Lagrangian generator is the 
square of the flux. Thus, if we compute flux up to leading order, the Lagrangian generator 
will take the form (0(r 2 ) + 0(r 4 )) 2 = 0(r 4 ) + 0(r 6 ). Since this is the order of magnitude up 
to which we agreed to do our calculations, we do not need to include anything higher. In 
other words, we will compute the variation of the angle up to leading order. 

Let us now look more closely at the sources of the displacement. It is clear that linear 
variation of E z will shift the two points (B and D) in the direction where the magnitude of 
E z is larger. Finite value of dE z /dx will result in 0(r 4 ) variation to flux per the argument 
we just made; thus, the effects of second derivatives are of a higher order and can be thrown 
away. As far as E x and E y are concerned, their projections onto the original contour were 
zero (after all, our choice of coordinates implies that the original contour lies in te-plane). 
Thus, in case of rotated contour their magnitude should be multiplied by sin# = 0(0) (where 
9 is the angle of rotation away from tz-plane). Apart from that, we were also assuming that 
E(0) is parallel to z-axis. Thus, E x and E y are of the order of magnitude of shift "away 
from" the origin, which is 0(r). Their flux, therefore, is 0(r 2 ) x 0(r) x 0(r) = 0(r 4 ). We 
notice that this is the main contribution of E x and E y to flux. Thus, we do not need higher 
order terms. 

Let us now discuss the order of magnitude up to which we compute the angle. The 
value of the angle is based on the maximization of flux. Now, since we are assuming that the 
angle is small, we already know the value of flux up to 0(r 2 ), which we treat as "constant". 
Furthermore, from our prior argument we know that the leading order correction is 0(t 4 ); 
in other words, there are no 0(r 3 ) terms. Finally, we also know that 0(0) = 0(r). Thus, we 
are interested in the terms of the form 9 k r l where k + / = 4. In light of scaling symmetry, 
every single term we produce needs to have a factor of r 2 which comes from the area of the 
contour. Thus, we are interested in the terms of the form r 2+k 9 l where k + 1 — 2. Therefore, 
the most general correction that satisfies these conditions is 



In order to maximize the flux we have to equate the 0-derivative of above with zero. In other 
words, 



= d e (ar A + br 3 9 + ct 2 6 2 + 0(r 3 9 2 ) + 0(t 2 9 3 )) = br 3 + 2c9r 2 + 0(r 2 9) + 0(t 2 9 2 ) (298) 



Correction = ar 4 + br 3 9 + cr 2 9 2 + 0(t 3 9 2 ) + 0(r 2 9 3 ) 



(297) 



which implies that 



9 = -K + 0(r 2 ) 



2c 



(299) 
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Thus, in order for 0(r)-term of Equation 12991 to be non-zero, we want b to be non-zero as 
well. This means that the correction term needs to include something of the form 9t 3 . Now, 
as we said earlier, everything is to the order of 0(r 2 ) or higher due to scaling symmetry. 
This means that we are only allowed an additional first order in r. This means that we can 
assume that all fields are linear. 

Now, far as d/dt is concerned, we can split the triangle ABC into triangles AOB and 
OBC, where point O is the center of Alexandrov set which we identify as the origin. If the 
electric field was linear in t, the contributions of these two triangles would have the same 
magnitude and opposite sign. This means that the correction of the flux through ABC 
due to time derivative will be zero. In non-linear case, that correction will, of course, pick 
non-zero terms from second derivatives, but this would be of higher order than what we 
are interested in (after all, second derivatives will produce a multiple of 0(r 2 ) which would 
be multiplied by 0(r 2 ) coming from area thus leading to 0(r 4 ) which, in combination with 
^-dependence would give 9t 4 instead of 9t 3 we are looking for). We can, similarly, split the 
triangle ADC into AOD and ODC and argue in the similar way as above that d/dt does 
not lead to any flux through ADC either, up to the order of magnitude we are interested in. 
To sum it up, we are making the following assumptions: 

a) Equations 12921 and 12961 hold exactly 

b) Equation 12951 holds up to approximations given in the equation 

c) All fields are linear 

d) All fields are time-independent 

We are now ready to perform the explicit calculation. In light of symmetry, we will 
restrict our calculation to triangle ABC, and the result will be easily generalizable to ADC 
and then the two answers will be added. The only non-trivial issue we will encounter is 
the presence of r 3 term in the case of triangle and its absence in case of rectangle. This is 
due to the fact that the center of triangle is displaced by 0(r) from the origin leading to 
0(r 2 ) x 0(r) = 0(t 3 ) whereas the center of rectangle is displaced by 0(r 2 ) from the origin 
leading to 0(r 2 ) x 0(r 2 ) = 0(r 4 ) . As one would expect, these 0(r 3 ) terms will cancel upon 
addition. 

Let us consider spherical coordinates. We will denote the angle between OB and z-axis 
by 9 and we will denote the angle between OB and xz-plane by 0. We will parametrize the 
OB line by A which will go from to r. Thus, 

x = A sin 9 cos , y = A sin 9 sin , z = A cos 9 (300) 

As we said previously, 9 is assumed to be "small". At the same time, no such assumption 
is made regarding 0. In fact, we treat x and y on equal footing and, therefore, can be 
anything between and 2ir: 

9 = 0(r) , G [0,2tt) (301) 

The height of each "strip" is 2 (r — A), where the factor of 2 comes from the fact that half 
of the strip has positive t-coordinate and the other half has negative. Therefore, the area of 
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the strip is 2d\{r — A). Since we assume t-independence of fields, the flux through that strip 



is 



<i$ABC — 2d\(r — A) (E x sin 9 cos + E y sin 9 sin + ^ cos ( 



(302) 



Furthermore, by using linearity and time-independance, we know that E z , E x and E y are 
given by 



E~ 



E„ 



dz 

OR, 



dz 



X f,M dE * 

A cos 9 H — - — 

o ox 

n dE V 

A cos 9 + — ^ 
o ox 



A sin 6* cos < 



BE* 



o 



ay 



x - n j. QE V 

A sin 9 cos H — —-^ 
o ay 



A sin 6 1 sin . 



£ z = £ z (0) + 



a£ 2 



dz 



x a^ dE > 
A cos 9 H — — 

o ax 



o 

<9£ 



A sin 9 cos + 
o dy 



A sin # sin 
A sin sin < 



(303) 



o 



By substituting Equation 13031 into Equation 13021 and taking the integral we obtain 

dE x 



ABC 



d\2{r-\) ( 

'dE, 



dz 



A cos 9 + — — 

o ox 



A sin 9 cos + —-— 
o oy 



+ 



dz 



A cos 9 + — JL 
o ax 



. . _ , dE v 
A sin 9 cos H — — — 
o dy 



o 



A sin sin sin # cos 0+ 



(304) 



A sin sin ) sin sin 0+ 



+ (£*(0) + 



dz 



A cos 9 H — — 

o ax 



A sm cos H — — 
o ay 



A sin 9 sin 


j cos # 








Now, in light of the fact that integration goes from to r, it automatically adds extra 0(r) 
to everything. Furhtermore, the factor 2(r — A) ands another 0(r). Apart from that, E x 
and E y terms are multiplied by sin# which is of 0(r). Finally, we also notice that all of the 
terms, except for E z (0) cos9, come with extra factor A which is also of 0(r). This means 
that in order to do calculation up to 0(r 4 ) we have to compute E x and E y terms up to finite 
part (and replace the overall coefficients of sin# with 9), and compute i? z -terms up to 0(r), 
except for E z (0) cos9 term which will be computed up to 0(r 2 ). This means that the integral 
simplifies to 



ABC 



J dA2(r-A)( 



dE x 



dz 



A#cos0+^ 
o dz 



9 2 \ dE z 



dz 



A — 

o dx 



o 

^ dE 
A#cos0 H — — 
o dy 



X9 sin 0+ 
X9 sin c 



(305) 



By evaluating the above integral and substituting the limits of integration A = and A = r 
we obtain 

„ , , T 3 dE z r 2 9 2 
$ ABC = r 2 E z (0) + 



3 dz 



o 



-E z (0) + 



(306) 



r 3 , (dE z 



dE x 

H 

o <9z 



r 3 # . /<9£ 2 

+ — sm n^r 



o <9z 



The reason we have the r 3 term in the above expression despite our earlier remark that 
such terms drop out is that we are integrating over the triangle ABC as opposed to the 
contour ABCD. When we have stated that variation of flux is of 0(r 4 ) we were referring 
to the fact that the "displacement" of points is of 0(r 2 ), leading to 0(r 2 ) x 0(r 2 ) = 0(r 2 ). 
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However, in case of a triangle, the displacement of the center of triangle away from the center 
of Alexandrov set is of 0(r) rather than 0(r 2 ). Thus, our assumption was strictly referring 
to square loop and not to the triangle. In case of square loop, on the other hand, the r 3 
contribution will drop out, as expected. After all, dE z /dz will give positive contribution to 
flux over triangle ABC and negative contribution over ADC, leading to cancellation. By 
inspection it is easy to see that dE z /dz is the only source of r 3 which implies that the 
flux over ABCD has no 0(r 3 ) terms. We will first perform the calculation over ABC, thus 
carrying r 3 term as we go along. Then, after we are finished, we will "copy" our answer into 
the expression for flux over ADC while changing the sign of r 3 term. After that, we will 
add the two answers and cancel r 3 (see Equation I319p . We will denote the fluxes through 
respective contours by 4>abci 4>adc an d 4>abcd- 



Let us now maximize the flux 0abc given in Equation 13061 We will first assume fixed 
and maximize the above with respect to 9. This will imply that 9 is some function of 0. By 
replacing the former with the latter we will re-express flux as a function of alone. After 
that we will do maximization with respect to which will produce quantity independent of 
both of the angles. As far as ^-maximization is concerned, we have 



<9$ 
~d9 



-#T 2 £,(O) + yCOS0( 



dE z 



dx 



dE x 
_| £ 

o dz 



T 

+ — sin. 

o/ 3 



dE z 



dy 



+ 



dE n 



dz 



This implies that, at any given point that is potentially a maximum, 



9 



t cos fdE z 



dE x 
_| E. 

o dz 



rsin0 fdE z 



3E Z 



3E Z (0) V dx 

Upon substituting Equation 13081 into Equation 13061 we obtain 

t 3 8E, 



dE„ 



dz 



$ ABC = r 2 E z (0) + 



3 dz 



+ 



o 



(307) 



(30£ 



(309) 



+ 



18£ 2 (0) VV dx 



dE z 



+ 



BE* 



dz 



COS0 + 



dE z 



+ 9Ey 
o dz 



sin ■ 



o/ v dy 

We would now like to maximize the Equation 13091 with respect to 0. In order to do that, we 
have to maximize a cos d> + b sin <b, where 



dE z 



dx 



dE x 
_l 1 

o dz 



dE z 



dy 



+ 



dE„ 



dz 



(310) 



In order to save ourselves a little bit of time, we can use geometry to maximize a cos <p+b sin 0. 
Consider a triangle PQR that is rotated by the angle with respect to horizontal line, and 
suppose that point P lies on that line. Furthermore, suppose that the segment PQ is 
orthogonal to the segment QR. Finally let M and iV be the respective projections of points 
Q and R onto the said horizontal line. From simple geometry, it is easy to see that 



and, therefore, 



PM = a cos d> , MN = b sin . 



PN = a cos d> + b sin . 



(311) 
(312) 



50 



On the other hand, the orthogonality of PQ and QR implies that 



Therefore, by noticing that 

we obtain 

This implies that 



PR = Va 2 + b 2 



PN = PR cos NPR 



a cos d) + b sin d> = V a 2 + b 2 cos NPR 



max(a cos <p + b sin <p) = \/ a 2 + b 2 
By substituting Equations 13101 and 13161 into Equation I309[ we obtain 



$abc = r 2 E z (Q) 



r 3 dE 7 



3 dz 



T 

o + 18£ z (0) 



dE 7 



dx 



dE x 



dz 



+ 



dE z 



dy 



+ dEy 
o dz 



(313) 

(314) 
(315) 
(316) 

(317) 



It is easy to see from symmetry argument that 0abd takes the same form as 0abc except 
that dE z /dz term has the opposite sign: 



$adc = r 2 E z (Q) 



r 3 dE 7 



3 dz 



T 

o + 18EJ0) 



dE z 



dx 



dE x 



dz 



+ 



dE z 



dy 



dE„ 



o ^ dz 



(318) 



Finally, by adding Equation 13 171 to Equation 13 181 we obtain the flux through the loop ABCD: 

^ < <dE x 



$ABCD = 2t 2 E z (0) 



T 



9E Z (0) \ \ dx 



dE x 
+ 

o dz 


) 2 + 


(dE z 




dEy 




0/ 


V dy 


H 




dz 


S) 



(319) 



The reason the Equation 13191 lacks r 3 term is that this term comes with opposite sign in 
Equation 13171 and Equation 13181 leading to cancellation. This is, in fact, what we expect: 
after all, back in the qualitative discussion we have argued that the flux through the rectangle 
is of 0(r 2 ) + 0(r 4 ) and, therefore, lacks 0(r 3 ) term. In our argument we have assumed that 
the displacement is of 0(r 2 ). This assumption is not correct for the case of rectangle: after 
all, the center of triangle is displaced from the origin by 0(r). Thus, our argument regarding 
the "absence of 0(r 3 ) fails in the case of triangle but stands in the case of rectangle. This 
implies that the 0(r 3 ) terms coming from the two triangles cancel out. This, in fact, is what 
we have just seen. 

Now, just like in the the magnetic case, our goal is to arive at covariant expression. 
We will use the same trick as before. First, we will restore spacelike rotational covariance 
by replacing z with E/\E\. After that, we will restore Lorentz covariance by replacing 5$ 
with f M and E k with v^F^. Let us start with space covariance. For our convenience, let us 
rewrite Equation 13191 as 

$ABCD = 2T 2 E Z {0) 



T 



+ 



dE T 



dz 



+ 



dE„ 



dz 



9EM 
dE 



dE z 



+ 2- 



dx 



dx 
dE, 



+ 



dE z 



o dz 



dy 

n 9E z 
o dy 



+ 



(320) 



dEy 




o dz 


o) 
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Let us first find the rotationally-symmetric expression for (d x E z ) 2 + (d y E z ) 2 . We can first 
add and subtract (d z E z ) 2 : 

(d x E z ) 2 + (d y E z ) 2 = (d x E z ) 2 + (d y E z ) 2 + (d z E z ) 2 - (d z E z ) 2 (321) 

which we can further rewrite as 

(d x E z ) 2 + (d y E z ) 2 = d k E z d k E z - (d z E z ) 2 (322) 

Now, since we have assumed that E is parallel to z-axis, 

E k = E8l (323) 

we obtain 

„ „ „ E i E j d k E i d k E j „ E ;L EidiEi . , 

= ; ^ = -^i- ( 324 ) 

By substitution of Equation 13241 into the Equation 13221 we obtain 

{a ^ + {a ^ = EE^- { E^iy (325) 

Of course, the above expression is only rotationally covariant, but it is not Lorentz covariant 
(as evident by the presence of spacelike indexes and absence of timelike ones). We will first 
deal with spacelike rotational covariance of the rest of the terms and we will come back and 
make things Lorentz covariant after we are done. Let us, therefore, move to the rotational 
covariance of (d z E x ) 2 + (d z E y ) 2 . Similarly to what we have done before, we will add and 
subtract (d z E z ) 2 term: 

(d z E x ) 2 + (d z E y ) 2 = (d z E x ) 2 + (d z E y ) 2 + (d z E z ) 2 - (d z E z ) 2 (326) 

We can now rewrite it as 

(d z E x ) 2 + {d z E y f = d 3 E k d 3 E k - (d z E z ) 2 (327) 
After that, we can again use 

E k = ES k (328) 

to rewrite the above two terms as 

k k _ E i E j d i E k d j E k 
d * E d ^E , d z E z - && (329) 

Finally, by substituting Equation 13291 into Equation 1327} we obtain 

/0 „ N2 /on ,j E i Eid i E%E k f E i E j d i E j \ 2 . , 

(d z E x ) 2 + (d z E y ) 2 = 00 > - ( && ) (330) 

Finally, let us evaluate d x E z d z E x + d y E z d z E y . Again, we add and subtract (d z E z ) 2 . But, 
for our convenience, we will express +(d z E z ) 2 as d z E z d z E z , while leaving —(d z E z ) 2 in the 
original form: 

d x E z d z E x + d y E z d z E y = d x E z d z E x + d y E z 8 z E y + d z E z d z E z - (d z E z f (331) 
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We can now rewrite it as 

d x E z d z E x + 8 y E z d z E y = d k E 3 d 3 E k - (d z E z ) 2 (332) 

We can now use 

E k = ES k (333) 
in order to rewrite the two terms on the right hand side as 

dkE : %E »-_EE^, dA -_EE^ (334 ) 

By substituting Equation 13341 into Equation I332[ we obtain 

E i Eid k E%E<* f &EidjEi y 
d x E z d z E x + d y E z d z E y = — . . (335) 



E l E l V E k E k 

We can now substitute Equations 13251 13301 and 13351 into equation 13201 to obtain 

r 4 / E i E i diE k dnE k / E i E j d i E j \ 2 



E i E j d i E k d j E k z&BPdilP \ 2 2E i E j d k E i d j E k 2 fE i E j d i E :i \ 2 



Now, the Lagrangian generator is given by 

Je = $Lcd (337) 
By substituting Equation 13361 into Equation 13371 and evaluating it up to 0(r 6 ), we obtain 

. h u 4r 6 /E i E^d i E%E k / E i E j d i E j \ 2 , . 

E i E j d i E k d j E k /ETE'diEPy 2E i E j d k E i d j E k /EPE'diEl^ 2 



E l E l V E k E k J E l E l \ E k E k 

Finally, let us rewrite the above in covariant form. Like we were saying earlier, t-axis is 
parallel to a "vector field" t> M which is associated with the direction of "smallest variation" 
of N 1 . Thus, 

u" = 5$ (339) 

We can now rewrite E k as F k0 and subsequently rewrite the latter as v^F^ where "Latin" 
index k now turns into "Greek" index v. 

E k = F k0 — ► (340) 

We will, however, leave \E\ 2 in that seemingly non-covariant form, mainly because we already 
have an expression for E, 



\E\ 2 = l [F^F a p - {sgn{F^F^{F^F a tf - l( 6a ^ s F^F^ ) (341) 
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which we "like" better since it has no reference to ?A Thus, we can rewrite Equation 13381 as 
J E = 4r 4 \E\ 2 + ^^«^¥/ P " r V a7 ^^7 



9 V |£| 2 

\E\ 2 > \E\ 2 

VpV^FPPF^d^y 2v p v a FPPF° v d r) EPd v E r i (VpV^F^F^dpF^YV 



■))' 



\E\ 2 / \E\ 2 v \E\ 

Now we have to ask ourselves whether or not we have to rotate the axis of Alexandrov set in 
order to minimize J . The 0(r 4 ) term is already minimized with the Alexandrov set chosen 
the way it was. This means that the 0(r 4 ) term behaves like 9 2 , where 9 is the angle by 
which we tilt. On the other hand, the 0(r 6 ) term has completely different structure from 
0(t 4 ) one. So we have no reason to believe its derivative with respect to 9 is small. Thus, 
we would expect it to have large linear term in 9. Therefore, J evolves as 

Je = Jeo + a9 2 r 4 + b9r 6 (343) 

This implies that it reaches the minimum at 

= = 2a9r 4 + br 6 (344) 
d9 

Therefore 

br 2 , s 

9 = -^ ^ 
The £ will be equal to J at that specific 9. In other words, 

/ br 2 \ 2 / br 2 \ h 2 r 8 

Thus, C deviates from J Q by 0(r 8 ) term. Since our calculation is up to 0(r 6 ), we can ignore 
that term. Thus, we can blindly identify C with the value of J for the Alexandrov set that 
has not beeen rotated. Thus, our final answer is 

C E = Ar 4 \E\ 2 + 4 ^( v yv p v a FWF™d ll F<*W v E M 



9 V \ E \ 2 
VpV^FP^F^dpF^ \ 2 v a v p v p v a F'» l F av d ll F ar id v Ffi 7l 



E\ 2 > \E\ 2 

VpV^FPPF^d^F^ 2 2v p v a FP> x F< TV d v EPd v E r > (VpV^F^F^dpF^^ 2 



\E\ 2 / \E\ 2 \ \E\ 

where \E\ 2 is given by Equation 11781 



\E\ 2 = 1\ F a ?F af} - (sgn{FP»F, v )) x l(F^F aP ) 2 - -^ 1& F^F^) 2 ) (348) 
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8 Corrections to total electromagnetic Lagrangian 



Let us summarize what we have found so far. In Section 5 we have found the the first 
order expressions for "magnetic" and "electric" Lagrangians. These were defined by merely 
squaring \B\ and \E\ respectively. The latter are given by "covariant" expressions, 

\B\ = \ [^F^F af} + X -e a ^F^F^ + (sgn(F^F^))^ F^F af3 - (349) 

\E\ = \ (Jf^F^ + ±e afhS F°PFr' - (sgn( F^ v F^ v ) ) \j F a P F a p - ±e afhS F°P Fr*\ (350) 

Then, in Section 6, we have found a correction to "magnetic" Lagrangian. The "corrected" 
expression is given by 

C B = 4|5| V + -^-(v^\B\r - ^-{e aM8 v%F^d s \m 2 - 
9\B\ Z 9|i5| 4 

r 6 8r 6 8t 6 

— -d»\B\%\B\ 2 + -^-v a v B d^F p a d^F 5 p - v y B F p a d^Fj- (351) 



9\B\ 2 3\B\ 3\B\ 

o_6 8t^ 8t^ 

_ V ovlF a d d^F p + — F a d d 13 F p - — vld F p d d v F a 

13 B p a 7 ?>\B\ 3\B\ 

Finally, in Section 7, we have found a correction for "electric" Lagrangian, and the corrected 
expression was given by 

C E = At a \E\ 2 + ^( v ^P^F^F^d lx F a W v E^ 



9 V |£| 2 



\E\ 2 > \E\ 

'VpVaVlFWF^duFvr, \ 2 2 V p V a FWF al 'dqEHdvF/l /U^/Wpw^ N 2x 2 

|£|2 ) \E\2 V |£|2 y ) 



Now, if we combine the two Lagrangians by using 

C = C B - C e (353) 

we obtain 

C = Ar\\B\ 2 - \E\ 2 ) + -^- 2 (v B d,\B\r ~ ^-(e a , lS v a B F^\B\r- 

9\B\ Z 9|i5| 4 

T ° -d»\B\%\B\ 2 + ^rvy B d^d 5 F a d p dP'F/ - ^. v y B F a d p dP'F^- (354) 



9IBI 2 3\B\ H 3\B\ 
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4r 6 /v a v\v a FWF' n 'd ll F or <d v Ef h 



9 V \ E \2 
'VpV^FP^F^d^ \ 2 v a v p v p v a FWF av d ll F ar id v Ffi 7l 



+ a p ° \ m (355) 



v l^r y \F\ 

'v p v a v^FP»F av d p F vr] \ 2 2v p v a FPf 1 F au d v E^d l/ E T ' /v p v tT v r ,Fi"'F vo d ll F vr i \ 2 \ 2 



From Equations 11771 and 11781 one can easilly see that e a/ 3 1 sF a,3 F lS cancel out when one 
computes B 2 — E 2 . Thus, as one would expect, 

B 2 -E 2 = F^F^ (356) 

This allows us to rewrite the above expression as 

C = 4r 4 F^ + -^- 2 {v%d,\B\ 2 f ~ ^(^sv a B F^d s \B\ 2 ) 2 - 

~ ^k 9 ^ 9 ^ 2 + ^ v ^B^F p -d p dPFf - ^L v y B F;d^F y P- (357) 
9|i5| z i\B\ 3\B\ 

- ^vlFfd^FJ + ^.F^dadfFf - K-<d.F;d^F: 

4r 6 /v a v^v p v a FP^F (TV d IJb F ar 'd v Ep 1 
~~9~\ \eY 2 

v p v a v v FPPF™dpF^Y v a vPv p v a FP»F™d p F^d v F Pri 



E\ 2 > \E\ 

VpV^FP^F^dpF^ ^ 2 2v p v a FP^F au d 71 E^d u E 7 ^ ^fVpV^F^F^d^ \ \ 2 



£" 2 / IE 2 v ig 



Thus, the leading order term coincides with usual Lagrangian. But then highly non-trivial 
effects come up in higher order corrections. 



9. Conclusion 

In this paper we have proposed a way of defining Lagrangian on a causal set. This required us 
to find a "geometric" way of defining Lagrangians so that we would no longer have to refer to 
coordinate system. Our intention was to design a geometric constructions in such a way that 
the resulting Lagrangians match their analytic definition in the coordinate case. However, 
in light of the discrete nature of causal set theory, these geometric constructions have finite 
size. As a result, they are bound to take contributions from higher order derivatives. Yet, 
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analytic solutions as we know it are only functions of first derivatives of the fields. This 
means that the geometric definition of Lagrangian no longer matches the analytic one at 
higher order. Furthermore, it is quite clear that the specific mismatch between geometric 
and analytic solution will be different for different choices of "geometry". This allows us to 
vary non-linear effects at will by varying the geometric definition of Lagrangians. 

In many cases, non linearity is introduced under the motive of "grasping at the straws" 
in attempts of explaining a phenomena that can't be explained linearly. One example that 
comes to mind is quantum measurement theory. But, of course, there might be other, more 
"mundane" examples where non-linear effects are hypothesized without direct observations. 
While none of these issues were explored in this paper, it is conceivable that this paper 
will be useful in reframing these situations some time down the road. In particular, one 
can attempt to consider that the hypothesized non-linearities came from the "geometric" 
introduction of Lagrangian as opposed to anything extra that had been "put by hand". In 
fact, one can even use 'scientific method" by looking at different "geometrical constructions", 
investigating predicted non-linearities of each one, and seeing which of the "non-linearities" 
match what we see in the lab. this paper, however, shows a natural way of introducing 
non-linearity. 

One very hypothetical example of this is the explanation of OPERA experiment. It is 
conceivable that non-linear interaction between neutrino and background fields would alter 
the speed of the former. While we have not attempted to tackle neutrino field, it was still 
demonstrated on the example of Klein Gordon and Maxwell fields, that non-linear effects can 
be vastly different in case of different Lagrangians. This implies that it is conceivable that 
one field (such as neutrino) is affected more than other fields. But, of course, we can't make 
this claim until we actually work out the neutrino directly. In fact, it should be emphasized 
that this paper made no attempt to study neutrino, nor did it present any argument that 
the "non-linear effects" would lead to simple deviation of speed of signal propagation; in 
fact, it seems more reasonable to believe that the non linear effects are a lot more complex. 
Nevertheless, it might still be worth it to try to tackle neutrino with different geometric 
constructions before dismissing the possibility. 

Independently of points made above, I believe that the prediction of non-linear effects 
is important from the point of view of predictive power of causal set theory. Due to the 
fact that causal set does not have regular structure (such as lattice) it is very difficult to 
perform any kind of analytic calculations on a causal set, and so far most of the work has 
been numeric (with a notable exception involving an "analytic" calculation of cosmological 
constant, [9]). This paper, on the other hand, provides an example of an analytic prediction 
of causal set theory by proposing a set of differential equations that deviates from the one 
typically accepted. At the same time, however, there is a serious shortcoming. In particular, 
the deviations lead to highly non linear terms, and, therefore, none of the usual methods of 
evaluating path integral will work. This means that a serious investigation into alternative 
ways of evaluating path integrals is in order. 
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